Implement AI-based automatic identification of sensitive columns in databases#4495
Merged
Conversation
feat(ai_recognition): Implement rule priority configuration using the strategy pattern feat(ai_recognition): Implement the process of sending sensitive columns in batches to the AI feat(ai_recognition): Implement the process of sending sensitive columns in batches to the AI feat(ai_recognition): Implement the process of sending sensitive columns in batches to the AI(completely restructured version)
fix(ai_recognition): 解决查询接口缺少AI相关字段 fix(ai_recognition): 去除AI识别器冗余逻辑 feature(ai_recognition): 为扫描任务增加多线程和异步 feature(ai_recognition): 去除置信度阈值功能 feature(ai_recognition): 完善为AI识别结果指定默认算法功能 feature(ai_recognition): 中断扫描功能 feature(ai_recognition): 优化提示词拼接逻辑 feature(ai_recognition): 优化AI调用代码 feature(ai_recognition): 删除冗余识别模式 feature(ai_recognition): 优化提示词
…or passive scanning and refactor the code in the sole AI mode
feature(ai_recognition): 集成测试
fix(ai_recognition):AI状态查询bug修复
fix(ai_recognition):代码格式化和集成测试
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
What type of PR is this?
type-feature
module-datasecurity
What this PR does / why we need it:
This PR introduces an intelligent, AI-based feature for automatically identifying sensitive columns in databases, addressing the limitations of the current manual, rule-based system. It aims to significantly improve both the accuracy and efficiency of data sensitivity discovery.
Implementation Overview
SELECTquery, providing proactive data security insights.ODC_APP_EXTRA_ARGS), including the API key, endpoint, and model name. The UI will reflect the service's availability based on these settings.localStoragecache.Which issue(s) this PR fixes:
Fixes #4489
Special notes for your reviewer:
Testing Suggestions
Please pay special attention to the following key areas as detailed in the test plan:
1. AI Configuration & Availability:
ODC_APP_EXTRA_ARGS.2. Passive Scanning Triggers:
SELECT * FROM ...query, verify the same "Scanning..." indicator appears above the results, followed by the outcome.3. Caching Behavior:
4. Scan Mode Logic: