Coconut Water Sentiment Analysis

An AI-Powered Pipeline for Historical Review Classification (1999–2012)

📌 Project Overview

This project is a high-precision sentiment analysis pipeline designed to categorize customer feedback for a coconut water brand. Using OpenAI's GPT-4o-mini, the system transforms raw JSON review data into sentiment labels (positive, negative, neutral, irrelevant).

The pipeline utilizes a 'Ralph Wiggum' agentic workflow (seen in label.py); this pattern ensures that the classification logic and API calls are autonomously iterated upon until they satisfy all rigorous unit tests before deployment.

Implementing the Ralph Wiggum pattern:

"I'm helping!" - Keep looping until tests pass.

while not tests_passed:
    rerun_sentiment_analysis()

Technical Architecture

The pipeline is modularized into three main components:

label.py (AI Engine): Interfaces with the OpenAI API using advanced prompt engineering. It features robust input validation to handle data-type anomalies and empty datasets.
visualize.py: Aggregates sentiment distribution and generates a simple pie chart, automatically exporting them to a dedicated images/ directory.
main.py (Pipeline Orchestration): The "brain" of the project that handles file I/O and executes the end-to-end flow from raw JSON to final classification.

Fig 1. Output from one execution of the visualize.py script.

Engineering Challenges & Solutions

1. Advanced Prompt Engineering

Instead of basic queries, I implemented a System-Prompt strategy that provides the LLM with cultural context and specific examples of nuanced sentiment. This ensures that a review like "its a ring" is correctly identified as irrelevant rather than neutral.

2. Test-Driven Development (TDD)

To ensure long-term maintainability, the project includes a comprehensive suite of automated tests (test_*.py). These verify:

API response consistency.
Correct visualization output formatting.
Error handling for "Wrong input" scenarios.

Getting Started

Prerequisites

Python 3.10+
OpenAI API Key (Stored securely via environment variables)

Installation & Execution

Clone the repository: git clone https://github.com/your-username/sentiment-pipeline.git
Install dependencies: pip install -r requirements.txt
Run the core pipeline: python main.py
Execute tests: python test_run.py

📁 Repository Structure

├── images/             # Generated sentiment distribution plots
├── reviews.json        # Source dataset (Coconut water reviews 1999-2012)
├── label.py            # GPT-4o-mini integration logic
├── visualize.py        # Data visualization module
├── main.py             # Pipeline entry point
├── writeup.md          # Qualitative analysis of results
└── .gitignore          # Safeguards for API keys and data artifacts

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
images		images
.gitignore		.gitignore
README.md		README.md
config.py		config.py
label.py		label.py
main.py		main.py
test_label.py		test_label.py
test_package.py		test_package.py
test_run.py		test_run.py
test_visualize.py		test_visualize.py
visualize.py		visualize.py
writeup.md		writeup.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Coconut Water Sentiment Analysis

📌 Project Overview

Implementing the Ralph Wiggum pattern:

Technical Architecture

Engineering Challenges & Solutions

1. Advanced Prompt Engineering

2. Test-Driven Development (TDD)

Getting Started

Prerequisites

Installation & Execution

📁 Repository Structure

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Coconut Water Sentiment Analysis

📌 Project Overview

Implementing the Ralph Wiggum pattern:

Technical Architecture

Engineering Challenges & Solutions

1. Advanced Prompt Engineering

2. Test-Driven Development (TDD)

Getting Started

Prerequisites

Installation & Execution

📁 Repository Structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages