Question 1

Does whisrs work on Wayland?

Accepted Answer

Yes. whisrs has native support for Wayland compositors including Hyprland, Sway, Niri, GNOME, and KDE. It uses compositor-specific protocols for window tracking and uinput for keyboard injection, so it does not depend on X11 tools like xdotool.

Question 2

Can whisrs work offline?

Accepted Answer

Yes. The local whisper.cpp backend runs transcription entirely on your machine. No API key, no internet connection, and no audio data leaves your device. The base.en model runs in under 500 MB of RAM.

Question 3

What speech recognition backends does whisrs support?

Accepted Answer

Seven backends: Groq (cloud, free tier), Deepgram REST and Deepgram Streaming (cloud, $200 free credit), OpenAI REST (cloud), OpenAI Realtime (cloud, true streaming over WebSocket), local whisper.cpp (offline, CPU/GPU), and a generic ASR sidecar that lets you bring your own local model (Moonshine, NVIDIA Parakeet, Microsoft VibeVoice-ASR, etc.).

Question 4

Can I use my own local ASR model with whisrs?

Accepted Answer

Yes. The generic ASR sidecar backend talks to a small local HTTP service that hosts the model. Ready-to-run sidecars are bundled in contrib/asr-sidecars/ for Moonshine, NVIDIA Parakeet, and Microsoft VibeVoice-ASR.

Question 5

Is whisrs a replacement for Wispr Flow or Superwhisper on Linux?

Accepted Answer

Yes. Wispr Flow ships on macOS and Windows but not Linux; Superwhisper is macOS only. whisrs is the open source (MIT) Linux-native equivalent: press a hotkey, speak, and text appears at your cursor. It supports both cloud and fully offline transcription backends.

Question 6

What Linux distributions does whisrs support?

Accepted Answer

whisrs works on any Linux distribution with the required system dependencies. Daily-driven on Arch Linux, with community-confirmed reports on Ubuntu 24.04 (GNOME and Xorg) and CachyOS (Niri). Install methods include AUR, cargo, Nix flake, pre-built x86_64 binary tarballs, and a universal install script.

Backend	Type	Streaming	Cost
Groq	Cloud	Batch	Free tier available
Deepgram Streaming	Cloud (WebSocket)	True streaming	$200 free credit
Deepgram REST	Cloud	Batch	$200 free credit
OpenAI Realtime	Cloud (WebSocket)	True streaming	Paid
OpenAI REST	Cloud	Batch	Paid
Local whisper.cpp	Local (CPU/GPU)	Sliding window	Free
ASR sidecar	Local (HTTP)	Batch	Free (BYO model)

Feature	whisrs	nerd-dictation	Speech Note	Wispr Flow
Platform	Linux	Linux	Linux	macOS, Windows (no Linux)
Wayland support	Yes (native)	Partial (xdotool)	Yes (GUI app)	N/A
Offline transcription	Yes (whisper.cpp + sidecar)	Yes (Vosk)	Yes (multiple)	No
Cloud transcription	Groq, Deepgram, OpenAI	No	No	Proprietary
True streaming	Yes (OpenAI Realtime, Deepgram)	No	No	Yes
Keyboard injection	uinput + XKB (layout-aware)	xdotool	Clipboard paste	Native
Window tracking	Yes (6 compositors)	No	No	Native
Architecture	Daemon + CLI	Script	GUI app	GUI app
License	MIT (open source)	GPL (open source)	MPL (open source)	Closed source

Component	Support
Hyprland	Tested by maintainer and community (Arch Linux)
Sway / i3	Implemented; additional reports welcome
Niri	Tested by contributor on Niri 26.04 (CachyOS)
X11 (any WM)	Tested by community on Ubuntu 24.04 (Xorg)
GNOME Wayland	Tested on Ubuntu 24.04 and Arch (overlay via bundled GNOME Shell extension)
KDE Wayland	Implemented via D-Bus; reports welcome
Audio	PipeWire, PulseAudio, ALSA (auto-detected via cpal)
Distros	Confirmed on Arch Linux and Ubuntu 24.04; any Linux with system dependencies

Voice-to-text dictation for Linux

Quick Install

Features

7 Backends

Works Everywhere

Layout-Aware Typing

Fully Offline

True Streaming

Tray + Overlay

Command Mode

Daemon + CLI

Transcription Backends

How whisrs Compares

Supported Environments

FAQ