Skip to content

SuperMuel/VoxMac

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

VoxMac

A native macOS app for speech-to-text transcription. Record audio with a global keyboard shortcut, transcribe it using OpenAI Whisper or Mistral Voxtral, and insert text into any active app. Features a minimal menu bar UI, customizable settings, and transcription history.

Why?

Because I dindn't find any existing app that would support Mistral's Voxtral 🤷‍♂️

Features

  • Global Keyboard Shortcut: Toggle audio recording with a customizable hotkey .

  • Speech-to-Text Transcription: Supports OpenAI Whisper and Mistral Voxtral APIs.

  • Text Insertion: Automatically inserts transcribed text at the cursor position using Accessibility APIs.

  • Menu Bar Interface: Displays recording status and provides quick access to settings and history.

  • Settings: Configure API keys, transcription service, insertion method, and keyboard shortcut.

  • Transcription History: View, copy, or delete past transcriptions with search functionality.

  • Onboarding: Guides users through initial setup, including microphone and accessibility permissions.

Screenshots

History Screenshot Settings Screenshot

Related

TODO

  • Allow model selection (not just provider). Useful for gpt-4o-transcribe or gpt-4o-mini-transcribe
  • Add ElevenLab's Scribe
  • Allow Custom vocabulary or custom prompt for elligible models
  • Queue service for async processing of transcription requests when offline

Warning

This app is 90% vibe coded. I never coded in Swift.

Logging

  • Uses Apple's Unified Logging with os.Logger via a small helper in VoxMac/Log.swift.
  • Categories: app, audio, transcription, history, database, permissions, notifications, ui, insertion, network, keychain.
  • View logs in Console.app: search for the app’s bundle identifier and filter by category/level.
  • Debug messages are verbose and best viewed during development; info/notice/error/fault are useful in production builds.

About

Native MacOS app for speech-to-text using Mistral Voxtral and OpenAI Whisper. Record with a global keyboard shortcut, transcribe and insert at cursor position.

Resources

Stars

Watchers

Forks

Contributors

Languages