Text Analyzer API

A robust Node.js/TypeScript API for analyzing text content with asynchronous processing and comprehensive monitoring capabilities.

Features

Text Analysis: Automatically analyzes text for word count, character count, sentences, paragraphs, and longest words per paragraph
Asynchronous Processing: Uses event-driven architecture for non-blocking text analysis
MongoDB Integration: Persistent storage with Mongoose ODM
Comprehensive API: RESTful endpoints with proper validation and error handling
Pagination Support: Configurable pagination for listing texts
Log Monitoring: Integrated Grafana + Loki + Promtail stack for log visualization
TypeScript: Fully typed codebase for better development experience
Docker Support: Complete containerized setup for monitoring stack

Project Architecture

Core Components

src/
├── app.ts                    # Express application setup
├── server.ts                 # Server entry point with database connection
├── config/
│   ├── config.ts            # Application configuration
│   └── database.ts          # MongoDB connection setup
├── controllers/
│   └── textController.ts    # Text CRUD operations
├── routes/
│   └── textRoutes.ts        # API route definitions
├── models/
│   └── Text.ts              # MongoDB schema definition
├── middlewares/
│   ├── errorHandler.ts      # Global error handling
│   ├── pagination.ts        # Pagination middleware
│   └── validation.ts        # Request validation middleware
├── events/
│   └── textProcessor.ts     # Asynchronous text processing
├── helpers/
│   └── text.ts              # Text analysis utilities
├── validations/
│   └── textValidation.ts    # Joi validation schemas
└── types/
    ├── index.ts             # Main TypeScript type definitions
    └── pagination.ts        # Pagination-specific types

Technology Stack

Runtime: Node.js 22+
Language: TypeScript
Framework: Express.js
Database: MongoDB with Mongoose
Validation: Joi
Testing: Jest
Monitoring: Grafana + Loki + Promtail
Process Management: Event-driven architecture

Getting Started

Prerequisites

Node.js 22 or higher
MongoDB running locally or remote connection
Docker and Docker Compose (for monitoring stack)

Installation

Clone the repository

git clone https://github.com/arifmahmudrana/node-text-analyzer.git
cd node-text-analyzer

Install dependencies
```
npm install
```

Environment setup

cp .env.example .env

Configure your .env file:

PORT=3000
NODE_ENV=development
MONGODB_URI=mongodb://localhost:27017/textapi

Running the Application

Development Mode

# Run with auto-reload
npm run dev

# Run with logging
npm run dev:log

Production Mode

# Build and run
npm run build
npm start

# Run with logging
npm run start:log

# Production with environment variable
npm run start:prod

Monitoring Stack (Optional)

Start the complete monitoring stack with Docker Compose:

docker-compose up --build

This will start:

Loki: Log aggregation service (port 3100)
Grafana: Log visualization dashboard (port 3001)
Promtail: Log collector

Access Grafana at http://localhost:3001 (no authentication required in development).

API Documentation

Base URL

http://localhost:3000/api

Health Check

GET /health

Check API status and environment information.

Response:

{
  "success": true,
  "message": "API is running successfully",
  "data": {
    "timestamp": "2025-06-02T10:30:00.000Z",
    "environment": "development"
  }
}

Text Operations

POST /api/texts

Create a new text for analysis.

Request Body:

{
  "text": "Your text content here. This can be multiple paragraphs with various sentences."
}

Response:

{
  "success": true,
  "message": "Text created successfully",
  "data": {
    "_id": "60d5f484f1b2c8b1f8e4e1a1",
    "text": "Your text content here...",
    "done": false,
    "numberOfWords": 0,
    "numberOfCharacters": 0,
    "numberOfSentences": 0,
    "numberOfParagraphs": 0,
    "longestWordsInParagraphs": [],
    "createdAt": "2025-06-02T10:30:00.000Z",
    "updatedAt": "2025-06-02T10:30:00.000Z"
  }
}

Note: The text analysis runs asynchronously. The done field will be false initially and true once processing is complete.

GET /api/texts/:id

Retrieve a specific text by ID (only returns completed analyses).

Parameters:

id (string): MongoDB ObjectId of the text

Response:

{
  "success": true,
  "message": "Text retrieved successfully",
  "data": {
    "_id": "60d5f484f1b2c8b1f8e4e1a1",
    "text": "Your text content here...",
    "done": true,
    "numberOfWords": 45,
    "numberOfCharacters": 234,
    "numberOfSentences": 3,
    "numberOfParagraphs": 2,
    "longestWordsInParagraphs": ["paragraph", "sentences"],
    "createdAt": "2025-06-02T10:30:00.000Z",
    "updatedAt": "2025-06-02T10:30:15.000Z"
  }
}

Error Response (404):

{
  "success": false,
  "message": "Text not found"
}

GET /api/texts

List texts with pagination and filtering.

Query Parameters:

done (optional): Filter by processing status
- true: Only completed analyses
- false: Only pending analyses
- all: All texts (default)
page (optional): Page number (default: 1)
limit (optional): Items per page (default: 10, max: 100)
order (optional): Sort order - asc or desc (default: desc)
orderBy (optional): Sort field - createdAt or updatedAt (default: createdAt)

Example Requests:

# Get all texts (first page)
GET /api/texts

# Get completed texts only
GET /api/texts?done=true

# Get second page with 20 items
GET /api/texts?page=2&limit=20

# Get texts ordered by update time (ascending)
GET /api/texts?orderBy=updatedAt&order=asc

# Combined filters
GET /api/texts?done=true&page=1&limit=50&orderBy=createdAt&order=desc

Response:

{
  "success": true,
  "message": "Texts retrieved successfully",
  "data": [
    {
      "_id": "60d5f484f1b2c8b1f8e4e1a1",
      "text": "Sample text...",
      "done": true,
      "numberOfWords": 45,
      "numberOfCharacters": 234,
      "numberOfSentences": 3,
      "numberOfParagraphs": 2,
      "longestWordsInParagraphs": ["paragraph", "sentences"],
      "createdAt": "2025-06-02T10:30:00.000Z",
      "updatedAt": "2025-06-02T10:30:15.000Z"
    }
  ],
  "meta": {
    "currentPage": 1,
    "totalPages": 5,
    "totalItems": 47,
    "itemsPerPage": 10,
    "hasNextPage": true,
    "hasPreviousPage": false
  }
}

Text Analysis Details

The API analyzes text content and provides the following metrics:

numberOfWords: Total word count (splits on whitespace)
numberOfCharacters: Total character count including spaces
numberOfSentences: Count of sentences (splits on ., !, ?)
numberOfParagraphs: Count of paragraphs (splits on double newlines)
longestWordsInParagraphs: Array of longest words from each paragraph

Analysis Process

Text is saved immediately with done: false
Asynchronous event is triggered for processing
Text analysis runs in the background
Document is updated with results and done: true
Completed analysis is available via API

Development

Available Scripts

# Development
npm run dev              # Start with nodemon
npm run dev:log          # Start with logging

# Production
npm run build            # Compile TypeScript
npm start                # Start compiled server
npm run start:log        # Start with logging
npm run start:prod       # Production with logging

# Testing
npm test                 # Run Jest tests
npm run test:watch       # Run tests in watch mode

# Code Quality
npm run lint             # Run ESLint

VS Code Debugging

The project includes VS Code launch configurations:

Debug Server (ts-node): Debug the main server
Debug Jest Tests: Debug all tests
Jest Current File: Debug current test file

Project Structure Details

Configuration (`src/config/`)

config.ts: Centralized configuration management
database.ts: MongoDB connection with error handling

Models (`src/models/`)

Text.ts: Mongoose schema for text documents

Controllers (`src/controllers/`)

textController.ts: Business logic for text operations

Middlewares (`src/middlewares/`)

errorHandler.ts: Global error handling
pagination.ts: Reusable pagination logic
validation.ts: Request validation middleware using Joi schemas

Events (`src/events/`)

textProcessor.ts: Event-driven text analysis processor

Helpers (`src/helpers/`)

text.ts: Pure functions for text analysis

Validations (`src/validations/`)

textValidation.ts: Joi schemas for request validation including createTextSchema

Types (`src/types/`)

index.ts: Main TypeScript type definitions
pagination.ts: Pagination-specific type definitions

Log Monitoring

Grafana Dashboard

Start monitoring stack: docker-compose up --build
Access Grafana: http://localhost:3001
Loki datasource is pre-configured
View application logs in real-time

Log Locations

Development logs: logs/app-dev.log
Production logs: logs/app-prod.log

Error Handling

The API includes error handling:

Validation Errors: Proper validation with Joi
Database Errors: MongoDB connection and query errors
Not Found: 404 responses for missing resources
Server Errors: 500 responses with error details
Graceful Shutdown: Proper cleanup on process termination

Performance Considerations

Asynchronous Processing: Text analysis doesn't block API responses
Database Indexes: Optimized queries with proper indexing
Lean Queries: Using lean() for better performance
Pagination: Efficient data retrieval with limits
Event-driven Architecture: Non-blocking processing pipeline

Environment Variables

PORT=3000                                    # Server port
NODE_ENV=development                         # Environment (development/production)
MONGODB_URI=mongodb://localhost:27017/textapi  # MongoDB connection string e.g. mongodb://root:123@localhost:27027/textapi?authSource=admin here root is username 123 is password and authSource admin set for authentication

Testing

Run the test suite:

npm test

The project uses Jest for testing with configurations for:

Unit tests for individual functions
Mocking for external dependencies

Run using a k8s cluster

If you are using arm architecture:

./k8s-local-dev-up.sh # to up the environment it will spin up MongoDB, PLG stack and create deployment all in a separate namespace

./k8s-local-dev-down.sh # to cleanup everything

If you are using a different architecture or then update the k8s/mongodb-values.yaml file image option and you can change the image tag in the k8s/app/app-deployment.yaml file and adjust.

Contributing

Fork the repository
Create a feature branch: git checkout -b feature-name
Make your changes
Add tests for new functionality
Run tests: npm test
Run linting: npm run lint
Commit changes: git commit -am 'Add feature'
Push to branch: git push origin feature-name
Submit a pull request

License

BSD 3-Clause License - see the LICENSE file for details.

Support

For issues and questions:

Open an issue on GitHub
Check existing documentation
Review the code examples above

Happy analyzing! 📊📝

Name		Name	Last commit message	Last commit date
Latest commit History 80 Commits
.github/workflows		.github/workflows
.vscode		.vscode
k8s		k8s
logs		logs
src		src
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
.prettierrc		.prettierrc
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
eslint.config.js		eslint.config.js
grafana-datasource.yml		grafana-datasource.yml
jest.config.js		jest.config.js
k8s-local-demo.gif		k8s-local-demo.gif
k8s-local-dev-down.sh		k8s-local-dev-down.sh
k8s-local-dev-up.sh		k8s-local-dev-up.sh
package-lock.json		package-lock.json
package.json		package.json
promtail-config.yaml		promtail-config.yaml
tsconfig.build.json		tsconfig.build.json
tsconfig.json		tsconfig.json

Folders and files

Latest commit

History

Repository files navigation

Text Analyzer API

Features

Project Architecture

Core Components

Technology Stack

Getting Started

Prerequisites

Installation

Running the Application

Development Mode

Production Mode

Monitoring Stack (Optional)

API Documentation

Base URL

Health Check

GET /health

Text Operations

POST /api/texts

GET /api/texts/:id

GET /api/texts

Text Analysis Details

Analysis Process

Development

Available Scripts

VS Code Debugging

Project Structure Details

Configuration (src/config/)

Models (src/models/)

Controllers (src/controllers/)

Middlewares (src/middlewares/)

Events (src/events/)

Helpers (src/helpers/)

Validations (src/validations/)

Types (src/types/)

Log Monitoring

Grafana Dashboard

Log Locations

Error Handling

Performance Considerations

Environment Variables

Testing

Run using a k8s cluster

Contributing

License

Support

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 5

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Configuration (`src/config/`)

Models (`src/models/`)

Controllers (`src/controllers/`)

Middlewares (`src/middlewares/`)

Events (`src/events/`)

Helpers (`src/helpers/`)

Validations (`src/validations/`)

Types (`src/types/`)

Packages