PowerPoint Content Extractor and Summarizer
A Python-based tool that extracts text and images from PowerPoint presentations and generates intelligent summaries using natural language processing techniques.
Features
- Extract text content from PowerPoint slides
- Extract and save images from slides
- Generate concise summaries using both frequency-based and transformer-based approaches
- Intelligent error handling for corrupted images or content
- Support for batch processing of slides
Requirements
python >= 3.6
python-pptx
Pillow
nltk
transformers
torch