Skip to content

ibadrajput7/Jarvis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

3 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

🧠 Jarvis

AI Voice Assistant & Automation System

Talk, command, and automate your system using natural voice interaction powered by AI.


πŸš€ Overview

Jarvis is an AI-powered voice assistant that allows users to interact with their system using natural language and voice commands.

It acts like a personal AI companion β€” similar to Siri or Alexa β€” but enhanced with modern LLM intelligence and automation capabilities.

Jarvis can:

  • πŸ’¬ Answer questions using AI
  • πŸŽ™οΈ Understand voice commands
  • πŸ”Š Respond using natural AI voice
  • βš™οΈ Automate system tasks
  • 🌐 Open apps and websites
  • 🎧 Provide conversational assistance in real-time

✨ Features

πŸŽ™οΈ Voice Interaction System

  • Real-time speech-to-text (Speech Recognition)
  • Natural voice conversation
  • Hands-free AI interaction

🧠 AI Intelligence Engine

  • Powered by OpenAI API
  • Context-aware conversations
  • Smart reasoning and explanations

πŸ”Š AI Voice Response System

  • Text-to-speech using ElevenLabs
  • Natural human-like voice output
  • Personalized assistant voice

βš™οΈ System Automation

  • Open applications (Chrome, apps, tools)
  • Control system commands
  • Volume up / down control
  • Execute predefined system tasks

πŸ’¬ Chat + Voice Hybrid Mode

  • Switch between typing and speaking
  • Seamless interaction modes
  • Persistent conversation flow

πŸ” Authentication System

  • User signup & login
  • Secure session management
  • Personalized assistant per user

πŸ— Architecture

graph TD
    A[React Frontend] --> B[FastAPI Backend]
    B --> C[OpenAI API]
    B --> D[MySQL Database]
    B --> E[Speech Recognition Engine]
    B --> F[ElevenLabs TTS]
    B --> G[System Automation Layer]
Loading

🧰 Tech Stack

Layer Technologies
Backend Python, FastAPI, MySQL
Frontend React.js
AI Engine OpenAI API
Voice Output ElevenLabs API (TTS)
Speech Input SpeechRecognition API
Automation OS-level command execution
DevOps Docker (optional)

πŸš€ Getting Started

1. Clone the repository

git clone https://github.com/your-username/jarvis.git
cd jarvis

2. Set up environment variables

Create a .env file in the project root:

DATABASE_URL=
OPENAI_API_KEY=
ELEVENLABS_API_KEY=

JWT_SECRET=

3. Run the project

docker compose up --build

The app should now be running locally via Docker.


πŸ“Š How It Works

  1. User logs in to the Jarvis system
  2. User speaks or types a command
  3. Speech is converted into text
  4. Backend sends the request to OpenAI
  5. AI generates an intelligent response
  6. Response is converted to speech using ElevenLabs
  7. Optional system command is executed
  8. Response is shown and spoken back to the user

🎯 Example Commands

πŸ’¬ AI Chat

  • "What is artificial intelligence?"
  • "Explain quantum computing simply"
  • "Write a Python function for sorting"

βš™οΈ System Automation

  • "Open Chrome"
  • "Increase volume"
  • "Open YouTube"
  • "Minimize window"

🚧 Roadmap

  • Voice-based AI assistant
  • Speech-to-text integration
  • AI chat system
  • Text-to-speech responses
  • Wake-word detection ("Hey Jarvis")
  • Desktop app version
  • Mobile integration
  • Smart home integration
  • Memory-based conversations

πŸ“‚ Project Structure

jarvis/
β”œβ”€β”€ backend/        # FastAPI backend (AI + logic + automation)
β”œβ”€β”€ frontend/        # React UI
β”œβ”€β”€ services/        # Speech, AI, and voice services
β”œβ”€β”€ automation/      # System command execution layer
β”œβ”€β”€ docker/          # Docker setup
└── README.md

πŸ’‘ Why Jarvis?

Jarvis helps you:

  • 🧠 Interact with AI using natural voice
  • ⚑ Automate repetitive system tasks
  • πŸŽ™οΈ Control your system hands-free
  • πŸ”Š Get human-like AI responses
  • πŸ’¬ Improve productivity through voice-first interaction

It's a next-generation AI voice assistant system designed for productivity and automation.


πŸ“œ License

This project is licensed under the MIT License.


πŸ“« Contact


⭐ If you like Jarvis, consider giving this repository a star!

About

Jarvis is an AI-powered voice assistant that enables natural conversation and system automation, allowing users to control tasks, ask questions, and interact with their device using voice commands.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors