Whisper Input / Qingyu — OpenLess-based Windows Voice Typing

English | 中文

Whisper Input / Qingyu — OpenLess-based Windows Voice Typing

An OpenLess-based Windows AI voice input tool for Chinese workplace writing: hold a global hotkey, speak naturally, turn speech to text, remove filler words, polish or structure the result, and insert it at the current cursor position.

Search intent: OpenLess alternative, Typeless alternative, typeless-alternative, Windows voice typing, AI voice dictation, Chinese speech-to-text, workplace dictation, Chinese-to-English voice input.

中文搜索意图：OpenLess 改造版、Typeless 平替、开源 Typeless、Windows 语音输入、快捷键语音转文字、职场语音输入、中文转英文语音输入。

Download latest Windows installer

🎯 At a Glance

Whisper Input is not a traditional IME, nor a meeting transcription tool.

It does one thing: press a shortcut key, speak, and it turns your spoken words into natural, well-structured text at your cursor position. If direct insertion fails, the result is copied to the clipboard as a fallback.

This project is built upon OpenLess, but it is not an official OpenLess distribution and is not affiliated with Typeless. It explores a Windows-first, cloud-first, BYOK direction for people searching for an open-source Typeless-style voice typing workflow in Chinese workplace scenarios.

Here are some typical scenarios:

Scenario	What you say	What Whisper Input produces
💬 Everyday chat	"Just handle this requirement like this for now, I will fill in the details tomorrow"	Clear, natural chat text with fewer filler words
🧑‍💼 Reporting to your boss	"Boss, there are three meetings this week, which one works for you?"	A formal request / meeting invitation
🧱 Task breakdown	"First push the code, second update the README, third publish the installer"	`1.`, `1.1`, `2.` structured text
🌐 English output	Dictate email content in Chinese	English email / Issue / work document
🔢 Number formatting	"Three yuan twenty-eight, tomorrow at two in the afternoon"	`3.28 yuan`, `Tomorrow 14:00`

🧭 How It Works

flowchart LR
  A["⌨️ Global Shortcut"] --> B["🎙️ Start Recording"]
  B --> C["☁️ Cloud Real-time ASR"]
  C --> D["✨ LLM Refinement / Polish"]
  D --> E["📍 Insert at Cursor"]
  E --> F{"Insertion Successful?"}
  F -->|Yes| G["✅ Done"]
  F -->|No| H["📋 Copy to Clipboard"]
  H --> I["🕘 Save to History"]

No need to switch input methods, open a chat window, or copy and paste manually.
Wherever your cursor is, the transcribed text appears there; if insertion fails, it is automatically copied to the clipboard as a fallback.

✨ Core Capabilities

Icon	Capability	Problem It Solves
🎙️	Chinese voice input	Designed for real work scenarios—primarily Chinese with occasional English terms
⚡	Low-latency pipeline	Recognizes, polishes, and inserts as quickly as possible after recording ends
🧹	Light polish	Removes "uh", "um", "like", filler words, repetitions, and obvious speech errors
🧱	Clear structure	Organizes multiple points from spoken text into hierarchical numbering
🧑‍💼	Formal expression	Converts to emails, requests, feedback, handoff documents, and other formal text
🌐	Chinese to English	Speak in Chinese, get English work text as output
🔢	Format normalization	Automatically formats currency, time, numbers, and corrects spacing and punctuation
📚	User dictionary	Preserves names, company names, product names, and technical terms
🕘	History	Local review, copy, and delete recent inputs
📋	Clipboard fallback	Ensures you get the result even if insertion into the input field fails

🧩 Four Output Styles

flowchart TB
  A["The same spoken text"] --> B["📝 Original"]
  A --> C["🧹 Light Polish"]
  A --> D["🧱 Clear Structure"]
  A --> E["🧑‍💼 Formal"]
  B --> B1["Keeps original wording, only adds sentence breaks and punctuation"]
  C --> C1["Removes filler words and repetitions, maintains original order"]
  D --> D1["Organizes into 1 / 1.1 / 2 hierarchical structure"]
  E --> E1["Converts to formal email, request, feedback, or work document"]

Original Dictation

Hey boss about that project acceptance I was wrong earlier it is not Tuesday it is Wednesday at two in the afternoon and also please check the contract and payment milestones and the testing part needs some changes too.

📝 Original

Hey boss, about the project acceptance—I was wrong earlier, it is not Tuesday, it is Wednesday at two in the afternoon. Also, please check the contract and payment milestones, and the testing part needs some changes too.

🧹 Light Polish

Boss, regarding the project acceptance—I was wrong earlier. It is not Tuesday, but Wednesday at two in the afternoon. Please check the contract and payment milestones. The testing part also needs some changes.

🧱 Clear Structure

Boss, regarding the project acceptance, the following items need to be adjusted:

1. Time Correction
1.1 The project acceptance is not on Tuesday, but on Wednesday at 2:00 PM.

2. Items Requiring Confirmation
2.1 Please review the contract and payment milestones.
2.2 The testing section needs adjustments.

🧑‍💼 Formal

Dear Boss,

Regarding the project acceptance, I would like to update you on the following:

1. Time Correction
The project acceptance date was previously stated incorrectly. It has been corrected to Wednesday at 2:00 PM.

2. Items Requiring Confirmation
Please review the contract and payment milestones. Additionally, the testing section requires further adjustments.

Thank you.

🧑‍💼 Formal Expression Example: Meeting Invitation

You can dictate naturally, just as you would speak:

Hello Director Li, there are three meetings this week: tomorrow is the Jiangsu Provincial Annual Conference, Thursday is the Chang'an Studies Forum, and Friday is the Factory Recruitment Fair. The locations are Jinan, Tai'an, and Xinjiang respectively. Which one would be convenient for you to attend? I would like to invite you to one of the meetings. Thank you.

The Formal mode would produce output more like this:

Dear Director Li,

I would like to request your attendance at one of this week's meetings as follows.

1. Meeting Schedule
1.1 Jiangsu Provincial Annual Conference: Tomorrow, in Jinan.
1.2 Chang'an Studies Forum: Thursday, in Tai'an.
1.3 Factory Recruitment Fair: Friday, in Xinjiang.

2. Request
We sincerely invite you to attend and provide guidance at one of these meetings. Please let us know which meeting fits your schedule this week.

Thank you.

It will not fabricate background information or expand on facts you did not mention. The focus is on organizing what you have already said into a format more suitable for professional communication.

🌐 Speak Chinese, Output English

flowchart LR
  A["🗣️ Chinese Dictation"] --> B["🧠 Understand Intent"]
  B --> C["🌐 English Expression"]
  C --> D["📍 Insert into Email / Issue / Document"]

You say:

Help me write something in English saying we have completed this update. The main fix was for the issue where long voice input text would get truncated, and we also improved the Formal expression mode.

Output:

We have completed this update. The main changes include fixing the issue where long voice input could be truncated, and improving the Formal style so that spoken content is converted into a more structured and professional format.

No need to think in English while typing, or write Chinese first and then copy it into a translation tool.

🔐 Data & Privacy

Whisper Input is a cloud-first product, not an offline ASR tool. You need to configure your own cloud ASR and LLM API keys.

flowchart TB
  A["🎙️ Your Recording"] --> B["☁️ Your Configured ASR Service"]
  B --> C["📝 ASR Text"]
  C --> D["☁️ Your Configured LLM Service"]
  D --> E["✨ Polished Text"]
  E --> F["💻 Current App Input Field"]
  E --> G["🕘 Local History"]
  H["📚 User Dictionary"] --> D
  I["🔑 API Key Config"] --> B
  I --> D

Data	Default Location / Destination
🎙️ Audio Recording	Sent to your configured cloud ASR service
📝 ASR Text	Sent to your configured LLM service
🕘 History	Saved locally by default
📚 User Dictionary	Saved locally by default
🔑 API Key	Stored in local config, can be cleared

You can clear history, dictionary, and API configuration from the settings.

⚙️ Recommended Configuration

Type	Recommendation	Notes
🎙️ Default ASR	Qwen Real-time ASR	Better overall Chinese performance and lower latency after stopping speech
🎙️ Backup ASR	Doubao Streaming Speech Recognition 2.0	Can serve as a backup pipeline
✨ Default LLM	Qwen / Gemini / Doubao	Choose based on region, cost, and availability
⚡ Low-cost mode	Lightweight LLM	Suitable for high-frequency daily input

The settings interface includes built-in common models and API endpoints. Regular users just need to select a service provider and enter their API key.

💰 Why It's Low Cost

Whisper Input uses your own API keys—no expensive subscriptions required.

flowchart LR
  A["Subscription Voice Input"] --> B["Fixed Monthly Fee"]
  C["Whisper Input"] --> D["Pay per actual ASR duration and LLM usage"]
  D --> E["Light usage typically costs about 1-2 RMB per month"]

Actual costs depend on your chosen service provider, model, audio duration, and usage volume. For light daily input, the cost is typically far lower than subscription-based tools like Typeless.

🚀 Installation & Usage

Open Releases.
Download the latest Windows installer.
Install and launch Whisper Input.
Go to "Settings - Model Settings".
Select the Qwen or Doubao option and enter the corresponding API key.
Press the global shortcut key and start speaking.

🧱 Product Boundaries

Whisper Input intentionally does NOT do the following:

Does NOT Do	Reason
❌ Register as a Windows system IME	Stays lightweight, does not take over the system IME
❌ Meeting transcription tool	Focused on short-to-medium text input, not a meeting documentation platform
❌ Chatbot	Does not generate information the user has not spoken
❌ RAG / Agent	Maintains its position as an input tool
❌ Offline ASR-first	Current focus is cloud-first, prioritizing real-world usability

🙏 Acknowledgements: OpenLess

Whisper Input is built upon OpenLess.

Thanks to the OpenLess authors and contributors for laying the foundation in desktop voice input, global shortcuts, recording state management, text insertion, and Tauri application infrastructure. Building on this foundation, Whisper Input pivots to a Windows cloud-first approach, focusing more on Chinese professional voice input, formal expression, Chinese-to-English translation, and low-cost API usage.

⭐ Star

If you find this project helpful, please consider giving it a star on GitHub to support continued development.

License

Apache License 2.0

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
.github/workflows		.github/workflows
docs/superpowers		docs/superpowers
public		public
scripts		scripts
src-tauri		src-tauri
src		src
.agent.md		.agent.md
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README_zh.md		README_zh.md
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Whisper Input / Qingyu — OpenLess-based Windows Voice Typing

🎯 At a Glance

🧭 How It Works

✨ Core Capabilities

🧩 Four Output Styles

Original Dictation

📝 Original

🧹 Light Polish

🧱 Clear Structure

🧑‍💼 Formal

🧑‍💼 Formal Expression Example: Meeting Invitation

🌐 Speak Chinese, Output English

🔐 Data & Privacy

⚙️ Recommended Configuration

💰 Why It's Low Cost

🚀 Installation & Usage

🧱 Product Boundaries

🙏 Acknowledgements: OpenLess

⭐ Star

License

About

Uh oh!

Releases 26

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Whisper Input / Qingyu — OpenLess-based Windows Voice Typing

🎯 At a Glance

🧭 How It Works

✨ Core Capabilities

🧩 Four Output Styles

Original Dictation

📝 Original

🧹 Light Polish

🧱 Clear Structure

🧑‍💼 Formal

🧑‍💼 Formal Expression Example: Meeting Invitation

🌐 Speak Chinese, Output English

🔐 Data & Privacy

⚙️ Recommended Configuration

💰 Why It's Low Cost

🚀 Installation & Usage

🧱 Product Boundaries

🙏 Acknowledgements: OpenLess

⭐ Star

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 26

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages