[improve/#137] RSS 크롤링 성능 및 안정성 개선 #138
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
❤️ 기능 설명
주요 개선 사항
1. 중복 처리 최적화 (성능 50% 개선: 28초 → 12초)
existsByUrl()개별 조회 (N번의 SELECT)PostRepository.findExistingUrls()메서드 추가로 배치 중복 체크 구현2. 병렬 처리 방식 개선
parallelStream()으로 테크 블로그마다 병렬 RSS 수집ConcurrentLinkedQueue→ 단순List사용으로 코드 간소화3. 좀비 프로세스 정리 최적화
4. 보안 강화
InsecureTrustManagerFactory코드 (MITM 공격 위험)5. 쿼리 최적화
TechBlog조회 시getReferenceById()사용으로 프록시 객체만 가져오기성능 측정 결과
연결된 issue
연결된 issue를 자동으로 닫기 위해 아래 {이슈넘버}를 입력해주세요.
close #137
🩷 Approve 하기 전 확인해주세요!
✅ 체크리스트