13.22.0

Improvements:

Huge performance improvements when scrolling the Transcript screen whilst playing back audio.

Clearer warning when your microphone is unavailable because the laptop lid is closed (clamshell mode).

Other performance improvements.

Bugfixes:

Fixes a crash when recording with mics that use interleaved audio (e.g. Blue Yeti).

Fixed recordings getting stuck and impossible to stop after closing the laptop lid. With the default ScreenCaptureKit backend, the recording now ends cleanly when the lid is closed and can still be transcribed. With the CoreAudio backend, recording resumes when you reopen the laptop (no audio can be recorded when the lid is closed).

Handle multiple taps on the Stop Recording button correctly.

Restored the Live Captions button in the Home screen shortcuts.

Fixed duplicate formats appearing in Batch Export.

13.21.4

Bugfixes:

Fixed a crash that could occur when quickly dismissing a discard dialog after a voice memo recording

Changed to ensure that if speaker identification ever stalls, it now allows the transcription to complete instead of waiting indefinitely

Fixed the Add Format button in Custom Export Formats settings

Fixed a transcription's last-opened date not always updating when you opened it

13.21.3

Bugfixes:

Fixed speaker recognition getting stuck at 99% on macOS 14 (Sonoma). This is a special release for macOS 14 users that reverts an internal change responsible for the issue.

13.21.2

Improvements:

Right-click export from the History screen now uses your most recently used export options instead of the format default

Bugfixes:

Fixed speaker identification hanging on macOS 14 (Sonoma)

Fixed Markdown not always rendering correctly in AI prompt responses and summary previews

13.21.1

New:

Custom webhook headers — define custom HTTP headers (with secure values) for your custom webhook integration

Improvements:

Added a "Setup AI Service" shortcut to the Dictation AI Service menu for quick access to AI Provider Settings

Bugfixes:

Fixed a rare issue where the app could fail to start after upgrading from an older version

13.21

New:

Record Meeting from MenuBar — quick links to start a meeting recording in your favourite meeting apps (Zoom, Teams, etc.) right from the menu bar

Send to integrations from transcript sidebar — integration buttons now appear in the transcript sidebar as well

Custom Export Overhaul — Auto Export now supports Custom Export Formats; PDF works as a Custom Export Format; Export settings split into "Export Formats" and "Batch Export" sections; new batch starts when dragging files onto Home

Improvements:

Subtitle font size setting in the video player

Auto cleanup for Pro users — transcription cleanup runs automatically when needed

Obsidian integration footer clarifies that the Obsidian plugin is required

Bugfixes:

Double-clicking a word now starts playback from that word — previously fell back to the start of the segment

< / > playback speed shortcuts work after jumping to a segment

Auto Export no longer fires when opening a .whisper file — only triggers for newly-completed transcriptions

Apple Foundation Model can no longer be deleted from AI Services

Dictation history inspector UX fixes — overlay close animation, selected-count behaviour, other small quirks

Active meeting time in the sidebar can no longer go negative

Meeting recording reliability — fixes for recordings stopping unexpectedly

Removes the Documents folder permission prompt on macOS Tahoe at first launch

HTML / PDF export now escapes <, >, & — fixes mangled output for transcripts containing these characters

CSV export no longer injects raw newlines inside quoted cells — Excel and split-parsers no longer choke

TXT / Markdown render speakers and timestamps consistently when grouping by people

"Favorited Segments Only" filter now respected by all export formats — previously CSV/HTML/JSON/Dote ignored it

Export screen no longer stuck on a spinner when there are no favourite segments

13.20

New:

You can now control MacWhisper from the CLI! Hook it up in your agent or scripting workflows. You can install it from settings > Advanced. View the documentation to see what you can do with it! https://macwhisper.helpscoutdocs.com/article/57-macwhisper-command-line-tool

13.19.2

Improvements:

Improvements to audio muting during dictation when using bluetooth headphones

Improvements to the design of the toolbar on the queue view

Improved error handling when setting up AI services

You can now rename active meetings and recordings from the sidebar by right clicking

Users on an Intel Mac no longer see model categories that are M-series only

13.19.1

Improvements:

Design tweaks

Clearer permission flow messages

Bugfixes:

Fixed an issue when using ElevenLabs where Chinese transcriptions would have an extra space between characters

13.19

New:

Added support for API keys when using the LM Studio AI service

Improvements:

Performance and stability improvements

Improved system audio recording permission flows

Adjust the default model loading and unloading behaviour for new users. By default models will not be loaded on first launch and will only be loaded in when starting a transcription or dictation. Models are automatically unloaded after 20 minutes so the app uses less memory when not in use. You can adjust these defaults in settings > advanced.

Meeting transcriptions work correctly if no audio is recorded in one of the sources

Improved naming of meetings in the queue, including app icon for the recorded app

Bugfixes:

Fixed an issue where a meeting could not be recorded if permission was not granted correctly or if it was revoked

13.18.1

Improvements:

Meetings are now shown with the correct name in the transcription queue

Improved text size and spacing across the app for textviews

Improved the start meeting flow from the home shortcuts section

Bugfixes:

No longer shows incorrect banners related to screen recording permissions

Toggling playback with the spacebar no longer triggers when also holding the Command button

13.18

Improvements:

Improved Teams meeting detection

More friendly errors when using Apple's Foundation Model AI service

Added delete button when selecting multiple segments at once

Improved rendering of AI chat output

Dictations that are in the queue now have a nicer looking name

When hovering over dictations in the history view the text no longer jumps due to action buttons appearing

If there are only failed or cancelled transcriptions in the queue the app won't present an alert anymore when closing the app

Add visual selected state when selecting items on the homescreen

Bugfixes:

Fixed an issue where the back button could disappear in settings

Fixed an issue where a meeting recording could not be stopped

Fixed an issue where summaries could sometimes be generated in a different language than intended

Fixed an issue where adjusting the number of speakers to detect could create duplicate speakers

Fixed an issue where available cloud providers could not show up in settings

Fixed a localisation issue where some users could see the app translated into German when their Mac was set to French. If you want to adjust what language the app is presented in, go to your Mac system settings, General and then Language and change it at the bottom

Fixed an issue where you could not deselect items by clicking on the background on certain screens

Fixed an issue where temporary files were not cleaned up properly in some cases

13.17

Improvements:

Improvements to how the punctuation setting can be used during dictations

Improved echo cancellation, enable it from Settings > Advanced

Live captions can now be added as a shortcut on the homescreen

Dictation errors are now displayed in your dictation history

Improved permission flow for recording meetings and app audio

Added support for the latest OpenAI models

When multiple history items are selected you can now export them as you would a single file

Improved the design and UX for handling multiple selected history items at once

Small design tweaks to shadows

Bugfixes:

Fixed an issue where when recording in some Chrome instances, no audio would get recorded

Fixed a crash when deleting the last custom export format

Obsidian integration folder names can now have spaces

Search state correctly resets now when switching between pages

When transcribing meetings and app audio recordings, auto export no longer exports individual transcripts but waits for the final transcript to be completed

Watch folder feature correctly observes multiple files across more layers

13.16

New:

You can now choose to not import the media file when transcribing a file. Note that this will not allow you to playback audio. Files you add to MacWhisper are cloned which means they wont take up double the storage space if the original file was on the same hard drive.

Beta release of echo cancellation for app audio recordings. If you are transcribing a meeting without wearing headphones you can try the new echo cancellation option from Advanced settings to not get duplicated transcriptios.

Improvements:

Added support for the latest AI models from OpenAI and Anthropic

Major scroll performance improvements on the history screen for users with more than 1.000 transcriptions

Added a button in the Help menu to give feedback on the translations of the app itself

Added a toggle to save watch folder transcriptions to the history as well

Meeting shortcuts are automatically added for meeting apps you observe

Bugfixes:

Fixed some text issues related to localising the app

Fixed an issue where transcriptions could not be selected and deselected in some scenarios

13.15

New:

German localisation. Die App ist jetzt vollständig ins Deutsche lokalisiert. 🇩🇪

Custom Export Formats. You can now save your favorite export formats and name and quickly access them from across the app.

Improvements:

Local AI services can now work when not connected to the internet

Improvements throughout the app to prevent memory leaks

Faster loading of history items when switching between categories

Improved file reading performance when opening files from other devices on your network

Bugfixes:

Spacebar can be used again to toggle playback when a transcript is opened in a new window

Fixed an issue where Scribe V2 would not be used in some cases

Fixed an issue when YouTube videos fail to transcribe

Progress reporting during meeting transcriptions is more stable

Fixed an issue where the Notion integration could fail on long transcripts

Fix for the dock icon lingering after closing the main window in menubar only mode

13.14

New:

Added support for exporting subtitles to Avid Composer .txt format

Improvements:

Toggle if speaker names show up in the subtitles in the video player

Splitting segments in the middle now properly sets the timestamps

Added new display options for the dictation indicator (center top, center bottom, textfield location or hidden)

Added option to hide subtitles completely in built in video player

Bugfixes:

Fixes to YouTube downloads

Hitting spacebar will not toggle playback when a transcript is not visible

13.13

New:

New improved history screen design which makes it easier to find the transcriptions and dictations you are looking for

Export word level timestamps by selecting "Words" in the grouping picker

View recently deleted transcripts and easily restore them

Integrations can now be triggered from Watch folder files as well

Improved scrolling and searching performance

Improvements:

Click a dictation in your history to view all information about it

Improved YouTube download stability

Transcriptions are loaded 10% faster

Prevents multiple instances of MacWhisper running at the same time

Easily select multiple transcriptions and export or delete them in bulk

Change language on the fly from the live captions window

Improved translation flow during live captions

Improved DeepL implementation to avoid changes to their API

App icons show up correctly for dictations in apps with spaces in their name

You can now rename the name of the app that was recorded in App audio

Add support for Opus 4.6

Delete selected items with keyboard shortcuts

Bugfixes:

Fixed an issue where ignored segments could not be saved in some cases

Fixed an issue in the dictation onboarding

When a file in history is on another computer that can not be reached, the app will not hang for an extended period of time

13.12.2

Improvements:

Add support for automatic meeting detection in Vivaldi Browser

Bugfixes:

Fixed a bug where items on the transcription queue could get stuck and keep restarting

13.12.1

Improvements:

Updated DeepL API implementation to support the latest version

Improved the dictation onboarding flow

Improvements to live captions

Bugfixes:

Fixed an issue where the microphone could not be changed in Global model

13.12

New:

Live translation in live captions. Automatically translate your captions as they are generated. This can be used at international conferences to give live subtitles or for language learning. This currently only supports Apple's translation service.

Use System or App Audio as a source for live captions. Generate live subtitles for any app on your Mac.

Improvements:

More themes for live captions, choose Dark, Light or System.

Stability and performance improvements

Improved dictation support in the WPS Office app

The timestamp adjuster in the segments view now allows editing milliseconds in increments of 0.1.

Muting audio during dictation now works with more external microphones

Doc export now uses the correct font

Bugfixes:

Live captions no longer show a ghost of previous live captions

13.11.1

Bugfixes:

Live Captions: Fixed an issue making results appear more reliably and immediately.

Live Captions: Fixed an issue where dismissing Live Captioning with the Escape key would prevent it from being started again.

13.11

New:

Realtime Captions are finally here! Enable them from the homescreen, menubar or a keyboard shortcut to instantly show captions straight from the microphone. Great for use during conferences or when you want live subtitles. Send us feedback on how we can improve this for your workflows! (Pro)

Added support for multiple language transcriptions with the Deepgram API. This will allow you to transcribe an audio file where multiple languages are being spoken.

Improvements:

When opening an audio file that has previously been transcribed you can now choose to not re-transcribe it again and just open the existing transcription instead

30+ improvements all across the app for stability and performance

You can open transcriptions made in Global mode again in the main app

Bugfixes:

Fixed an issue where the transcript could be sent twice when summarising, thus leading to more tokens being used

Fixed a crash that could happen in Global mode

13.10.4

Bugfixes:

Fixed an issue whilst using the Anthropic API.

13.10.3

Bugfixes:

Fixed an issue where the Global feature could incorrectly display the Upgrade to Pro button.

13.10.1

Bugfixes:

Fixed an issue certain pro-only models could not be loaded even with an active license.

13.10

New:

Notion integration support. Automatically send a finished transcript to a Notion page.

Added the option to also auto export the summary as a text file

Improvements:

Free users can now use the large v3 turbo model as well. Exports are not possible when using pro models however.

Improved error handling for users with vpn or firewalls when validating license keys.

macOS 14 is now supported for the new 13.0 update

Added support for GPT 5.2 models

Bugfixes:

Fixed an issue where a random UUID was briefly shown sometimes as the title of a transcription

Fixed an issue where you could not select a language for Deepgram Nova 3 models.

13.9

Improvements:

Improved the performance of the downloader

When dragging multiple files onto the app you can now choose to batch export them, or to transcribe them directly into the app

Added multiple file title formats for use with integrations. Add the date and time to the file name for example.

Added more places to rename transcriptions and meetings across the app

Added a prompt before closing the app if an active recording or transcription is in progress

Copying a transcript no longer adds an extra newline under the speaker's name

Bugfixes:

Fixed an issue where the old transcript could be displayed briefly when retranscribing

No longer show an empty parakeet models screen for free users

13.8

New:

Added support for Yandex Telemost for automatic meeting detection

Improvements:

You can now choose which folder is used for the Obsidian integration

Added a test button for the Obsidian integration

You can clear a completed voice memo from the sidebar

Bugfixes:

Media files that are not linked to database entries are automatically cleared

The transcription progress circle now updates correctly when retranscribing

13.7.1

New:

Added support for integration with Obsidian. Automatically (or manually) send transcripts to your Obsidian vault

Added a toggle for automatically sending transcripts to integrations

Manually send transcripts to integrations by right clicking on history, from the quick export menu or from the export view

You can now export directly from the history screen

Improvements:

Greatly improved performance when transcribing more than 10 files at once

Show segment highlighting during playback for models that don't support word level timestamps

Retranscribing transcripts will now separate speakers if that was set in the original transcription

Improved navigation flow when filtering by speakers in the sidebar

Added a background to the progressbar for easier viewing on macOS 15

You can configure watch folder transcript formats to separate by speakers again

Bugfixes:

Fixed an issue where renaming a speaker could fail

Removed a flicker that could happen when a transcription finished on the homescreen

Fixed an issue where an AI chat prompt could not be modified

13.6

New:

Integrations! You can now automatically send finished transcriptions to Make.com, n8n, Zapier or a custom webhook. This current release can automatically send the title and raw transcript to these services. Please let us know what else you want to configure. We're also working on adding support for Obsidian and Craft, if you have other suggestions please email us. (Pro)

Improvements:

Added support for the new Gemini 3.0 models

Added support for new Anthropic models

Improved reliability of dictation with AI prompts so that it should not answer your dictations any more. If it still happens let us know on support@macwhisper.com.

Exports are automatically grouped by speaker if a transcript contains more than one speaker. You can still change grouping on the export page.

You can now quickly change AI providers on the dictation prompt test view

Added extra guidance to the dictation settings if you chose the Fn key without setting it up correctly in macOS settings

The display name for imported .whisper files has been improved

If you manually renamed a transcript its name will be shown instead of AI generated titles

Improved support for dictating into WeChat

Bugfixes:

Fixed an issue where license data could not be determined in certain edge cases. If you still get an error saying your Argmax Pro license is not valid, please reach out.

Removed an annoying beep when going through search results

Fixed an issue when using files that were too large for the OpenAI cloud transcription api

13.5

Improvements:

When searching through a transcript, moving to the next occurrence will scroll the transcript as well

Added support for OpenAI GPT-5.1 models

Added filter icon back for the segment view

The initial group of recognised speaker names is now sorted correctly

Bugfixes:

Removed the Scribe v2 realtime model (will come back later)

13.4

New:

Added support for the new ElevenLabs Scribe V2 and Scribe V2 Realtime cloud models

Added back the option to automatically export the audio recordings for voice memos, meetings and app audio recordings. Enable this from Settings > Auto Export

Improvements:

Improvements to meeting detection for Teams and Zoom

New subtler sound effects for dictation

Added a button to transcribe voice memos and app audio recordings at a later time (still saving them to the database for easy access)

Added the option to toggle the app visibility mode through the menubar

Added a language picker for summaries to guide the AI service to generate the summary in your preferred language

Bugfixes:

If you cancel a toggled dictation by pressing Escape you can now start a new dictation by hitting the dictation key just once

13.3

Improvements:

Opening settings from the menubar is now more reliable

Active meetings no longer stay visible in the overlay if the meeting recording is started from the notification

Decreased the padding in the sidebar on macOS Sequoia to give options more space

Clearer error messages

Improved the search and find and replace feature in the transcript view

Added support for .qta (Quicktime Audio) files

Added support for GPT-5 models

Bugfixes:

Fixed an issue where Zoom or Teams meetings could not be started in certain scenarios from the shortcuts section

13.2.1

Improvements:

You can now enable speaker detection when recording app audio or system audio. Toggle it on from the sidebar before you start recording

Bugfixes:

Fixed an issue where dictation could stall for some users on macOS 26.1 or the 26.2 developer beta. Thanks for helping us figure out the cause Patrick, Antonio, Remco, Martin, Wilco and Marc!

13.2

New:

Your dictation history now also shows the app that was dictated into (new dictations only)

Improvements:

When using OpenRouter, requests from the app will now be properly named in your OpenRouter dashboard

Improved search and scroll performance in list view

Re-added support for m4b files that are not protected by DRM

Split segments can now be undone

Clearer alerts when AI providers are not setup correctly

Failed transcriptions close automatically

Summaries that are being generated can be cancelled

YouTube downloads now properly show transcription progress

Dictating with a Parakeet model is now 30% faster

Improved the speaker renaming, editing flow

Meeting names that are taken from the calendar events are more consistent now and will always use the name of the upcoming meeting. If you run into edge cases here let us know.

Added the option to show names instead of icons in the toolbar

Design tweaks to the people section in the sidebar

You can now discard a recorded voice memo

History now shows the date related to your sorting method (last opened, created, last modified)

The audio duration is now shown in history

Bugfixes:

Apple Speech transcriptions can be properly cancelled now

Fixed an issue where dictation could fail on first run of the app

Fixed an issue where renaming speakers could create a new speaker with the same name in an edge case

13.1

New:

Added the option to show your meeting history in the sidebar as well for quick access. It can be enabled from settings > customisation

You can now combine people and their transcripts. Right click the person you want to combine into another person from people settings or the left sidebar

You can now directly export a transcript from the home screen by right clicking, saving you a step from having to open it first

Improvements:

Added quick toggles in the toolbar to switch between list and grid view

Allow renaming transcriptions in list view

You can now show up to 50 items on the home screen history view

Added the option to hide apps from the app audio list

You can now hide the people list in the sidebar from settings > customisation

Toggling off speaker icons also hides them from the left sidebar

Duplicated speakers that are all named Microphone will be automatically merged the first time you run this version

Bugfixes:

Added the option to never show ai summaries in the history view

13.0.5

Improvements:

Opening a .whisper file from Finder now opens it immediately instead of only adding it to your history

Bugfixes:

When you're recording a meeting, notifications about that app no longer show up

Fixed an issue where the transcript would open when you were renaming it and then using a space in the name

Prevent the summary feature from running multiple times at once

The voice memo screen resets properly when transcribing

13.0.4

New:

Added back the option to automatically export a .whisper file when creating a transcription. You can enable this from settings > auto export

Improvements:

Fixed an issue where speaker recognition was not working for users that had "automatically convert segments to sentences" enabled in Advanced Settings. Thanks Jody, Christian and Steven for helping us figure this one out!

13.0.3

New:

Improved speaker recognition with the Pyannote 4 diarization model

Added support for Hugging Face Inference Providers to use as an AI Provider

Improvements:

Added support for the Comet browser

Improved dictation output with Foundation Models to not have >

You can now choose which speaker is used for your microphone in meetings, instead of choosing just the text (which would create a new speaker each time)

Bugfixes:

Fixed an issue where the full summary generator would use the model that was selected for short AI summaries

Duplicated speakers that occurred after upgrading to 13.0 should be combined again automatically when launching this version

Batch Export and Watch Folder export properly shows speakers

The summary feature now uses the selected AI service instead of the one that was set for short summaries and AI titles

13.0.2

Improvements:

Audio playback now only starts when double clicking on a part of the transcript instead of a single click

Improved dark mode support

Bugfixes:

Fixed an issue where the AI screen could be shown on screens other than the AI screen

13.0.1

Improvements:

Items in the queue that belong to the same transcription task (such as meetings or batch) are now grouped

Custom Gemini models now work better even when no safetyRating is passed

Improved support for dictation in the ChatGPT Atlas browser

Bugfixes:

Exporting to PDF works again

When importing your history, speakers with the same first and last name are now grouped

13.0

Thanks for your patience while we worked on this major new update. Please let us know if you run into anything we can fix or improve! Email us at support@macwhisper.com, thanks!

New

Completely reimagined design that embraces Liquid Glass

View your full transcription history with full text search and text previews in a list or grid

Customize your Home Screen with shortcuts, including to record meetings in your favorite apps

Open multiple transcriptions at the same time in the main window and easily switch between them from the sidebar

Open a transcription in a new window by right clicking the history item

New dedicated Summary and AI chat tabs for easier access

Simplified transcription sidebar showing only the most relevant information for the active display mode (transcript, summary, ai)

Full color theme mode options (beta)

Transcribe multiple files one after another with the new transcription queue. Transcribe across different models and providers without having to stop ongoing transcription jobs.

Transcription Improvements

Improved transcription performance across all models by ~10% due to improvements in macOS Tahoe.

Added support for transcribing with the new improved Apple Speech models which are very fast and probably already downloaded on your device

You can now retranscribe System Audio/ Recorded Meetings/ Podcasts as well, preserving the multiple tracks.

Added support for the new OpenAI 4o-diarize cloud model

Speaker Improvements

You can now add photos for speakers

You can now search and filter transcripts by speakers

Your most used speakers are visible from the sidebar and can easily be filtered on

Dictation Improvements

You can now dictate while an active transcription is in progress

You can now use a different model for dictation and transcription. More options for this are coming in subsequent releases.

You can now test your dictation prompts in dictation settings to see how your enabled AI service will perform.

View, playback and copy your dictation history from the main app

Added support for dictation in the ChatGPT Atlas browser

AI Services

You can now use an AI service to automatically generate titles for your transcripts and to create short ai summaries to show in your history

You can choose which (local) AI provider to use for automatic short summaries and title generation to ensure your transcript data stays private. You can also choose to use a a non-local model if you're comfortable with your transcription leaving your device.

Added support for Apple Foundation Model on macOS 26 which lets you summarize and chat with your transcripts without your transcription data leaving your device.

Meetings

Meetings are automatically transcribed after they're finished

You can record a new meeting while the previous one is transcribing in the background

Quickly start a new meeting recording by adding a home shortcut for your desired meeting app

Added support for meetings in the ChatGPT Atlas browser

General Improvements

Translations can now also be shown with speaker grouping

You can navigate to the rest of the app while recording a new voice memo or app audio recording

Models can be changed without having to wait for them to finish loading

Sort your history by last opened, last edited or creation date

Added more font size options

Watched Folders now run in the queue without seeking confirmation.

Added support for the latest Anthropic models

When adding a new speaker to a segment it's automatically applied to the segment you had selected

Bugfixes

YouTube transcriptions work again.

Fixed an issue where turning automatic meeting recording off could spike CPU usage

12.18.3

Bugfixes:

Fixes an error alert that could occur when downloading certain local models

12.18.2

Bugfixes:

Fixed an issue where some models could not be downloaded

12.18.1

Bugfixes:

Fixed YouTube downloading

12.18

New:

Added support for Parakeet v3!
Parakeet v3 supports multilingual transcribing in 25 languages, in the same transcription or dictation!
- Bulgarian, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, German, Greek, Hungarian, Italian, Latvian, Lithuanian, Maltese, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish, Russian, Ukrainian
Added support for new OpenAI open source models

Improvements:

Added support for dictation in Notebooks

Bugfixes:

Fixed opening Settings from the menu bar

12.17

Improvements:

Improved dictation support in Ghostty
The toggle to recognise speakers for microphone recordings is persisted across recordings now
Clearer error messages if a model download fails
Added a toggle to choose the formality for DeepL translations
Find and replace clears the find textfield if all occurences are replaced
Added support for the latest OpenAI models

Bugfixes:

Fixed an issue where you could not open previously made whisper files as a non-pro user, even if they model used did not require Pro
JSON exports now have proper .json file extension

12.16.1

Improvements:

Speakers are identified better in VTT exports.

Fixes:

Parakeet models: short recordings could sometimes lead to empty results
Parakeet models: some words were being truncated
Parakeet models: model might not finish loading after first-run.
Issue where speaker recognition might not complete on transcriptions that were run immediately after launch
Workaround for dictations that weren't working in Ghostty
Fix visibility of retry button for model downloads if they failed

12.16

Improvements:

Speaker recognition is up to 60% more accurate now
Added support for Azure models that require temperature to be set to 1.0

12.15.1

New:

Added support for the Grok 4 model
Added a Hebrew language specific Whisper C++ model

Improvements:

You can now switch between found search results with button in the search bar
Added model picker for xAI models
Design tweaks for improved readability

Bugfixes:

Fixed an issue where setup ai services would not show for some users
Improved accuracy of parakeet models to prevent words sometimes getting cut

12.14

New:

Added the option to setup AI services through the MDM plist

Improvements:

Added the option to show timestamps in full transcript exports
All the advanced models are available again
Stability improvements

Bugfixes:

Fixed an issue where you could get an alert when a model was still loading, instead of the transcription starting
Hide the calendar button since it's still in development, oops

12.13

Improvements:

The app gives a clearer message about needing to update to macOS 14.4 or newer to use the Parakeet models
Improved stability of the dictation feature
Added an extra warning to prevent data loss by not transcribing app audio recordings
The dictation overlay is presented above third party spotlight-like overlays
Dictation works again on www.chatgpt.com
Design tweaks
Font size can be changed with Cmd and the -/+ buttons instead of just the -/+ buttons
WhisperKit models are presented for m1 users again

Bugfixes:

Fixed an issue where the app could say "Whisper is busy"
Fixed an issue where batch transcriptions for short audio files with Parakeet models could fail
When modifying a custom dictation AI prompt it's enabled without having to toggle it first

12.12.2

New:

Added support for the new Parakeet model, which is extremely fast and accurate! It can transcribe at up to 300 times real speed on the latest Macs. This current release only supports English, please try it out and let us know what you think! (Pro)

Improvements:

The progress indicator now shows progress more accurately and calculates the time remaining more often
Design cleanup around the app
More localization of error states
Added support to meeting detection for Zen and Dia Browser

Bugfixes:

Fixed an issue where recordings could fail on Macs that have their language set to Portuguese due to a file name issue
Fixes for using the Claude 4.0 models

12.11

New:

Batch transcriptions can now be paused and resumed (transcriptions will pause after the active item is finished) (Pro)
Added support for the new Claude 4.0 models (Pro)
You can now drag out .whisper files from the sidebar to other apps on your Mac

Improvements:

Meeting notifications don't require two clicks anymore (one to activate the window, one to click the button)
Pro users no longer briefly see pro badges on launch
Dictation can now also be muted when playing audio through a Studio Display
Improved the new floating video player
Clearer error messages when LM Studio models return errors

Bugfixes:

No longer showing an empty window in Mission Control if you use the menubar only mode
Fixed an issue where the app was not immediately responsive on launch
When pausing with spacebar, the videoplayer now also pauses correctly

12.10

New:

Added a new floating video player mode

12.9.1

New:

You can now select how many speakers are in the transcript and run speaker recognition again for improved results (only when using local WhisperKit models)
Pro models can now be used for dictation for free users as well

Improvements:

Improved the remove dashes enhancement feature to include more occurences of dashes at the end of segments as well as only segments with a dash
Added the option to only export favorited segments to segment or subtitle export
Added more options for MDM deployments

Bugfixes:

Fixed a crash when deleting the last segment on the speaker view
Fixed an issue where meeting recordings could have sped up audio

12.8

New:

Automatically transcribe files in watch folders! Files that are added to your watch folders can now automatically be transcribed into multiple formats. Thanks for the feedback and testing on this!

Improvements:

YouTube downloads are working again!
Added more options for MDM deployments to set or limit certain features
Added back the ability to mute audio while dictating. MacOS 15.4 removed the option to pause audio, so this brings back that functionality in the form of muting audio instead.

Bugfixes:

Fixed an issue where the app window would sometimes not open when clicking the Dock icon
Fixed an issue where dictation would not start unless the main window was opened when using menu bar only mode
Fixed an issue where dictation errors could not be dismissed

12.7

Improvements:

You can now rename files from the sidebar
Fixed an issue in the dictation onboarding
Removed timeouts for ElevenLabs and Deepgram for when you're transcribing long files
Improved support for more .ogg files such as WhatsApp audio messages
Updated the design of the task overlays in the bottom right corner of the main window

Bugfixes:

Fixes for recording meetings in a Chrome PWA
Fixed a visual bug that could appear at 100% when using cloud transcription

12.6

Improvements:

Improved the first run experience for new users
Whitespaces are automatically stripped when adding your own API keys
The history sidebar looks nicer

Bugfixes:

Removed "Pause Audio during Dictation" since macOS 15.4 broke it. We're working on bringing it back.

12.5

New:

Punctuation in dictation. You can now toggle punctuation mode which will automatically add punctuation such as "new line", "question mark" and others.

Improvements:

Improvements to meeting recording
Fixed an issue where YouTube downloads were using the wrong audio track
Speaker recognition now also works in System App Audio recordings and when downloading URLs
When using the ElevenLabs API the file size limit is now set correctly
Improvements to speaker recognition
New lines are stripped out of dictations that use Gemini
Meeting recordings can now be transcribed with cloud providers as well as long as the file is under the file limit for the provider
Added an export option to only export favorited segments
Added support for the new GPT 4.1 models

12.4

Improvements:

You can now rename meetings from the sidebar (right click) or the toolbar
Clearer error codes when a meeting recording can not be started
Added a space after each dictated piece of text
Text is now aligned to the leading side of the view instead of justified
Added an icon for the history bar for system audio recordings
Added the option to show/hide the speaker names in the segment view
You can now export the combined audio from a meeting
Added support for recording meetings in Orion browser
Stability improvements

Bugfixes:

Fixed a potential crash when recording a meeting
Fixed an issue where duplicate files would get saved which would take up a lot of storage when editing a whisper file
Fixes for Deepgram integration
Fixed an issue where error alerts in dictation mode could not be dismissed

12.3

Improvements:

Improvements to the record meeting feature to make it more stable.
App Audio recordings now show the app icon of the app that was recorded
When you change the AI prompt in the dictation popup, that will be the default from that point on
Added a space after a dictated sentence
Improvements to speaker recognition
The onboarding is improved for MDM deployed versions of the app

Bugfixes:

Fixed an issue where meeting recording could crash
Fixed an issue where a microphone would not be recorded in meetings in specific setups

12.2

New:

You can now filter your transcript by unknown speakers, and quickly jump to the next occurence of an unknown speaker
Added the option to use custom cloud transcription providers based on the OpenAI whisper spec. This can be used for running transcription on your own private server endpoints.
Added support for language specific models. Currently Swedish and Japanese, more are coming.

Improvements:

Speaker recognition now also works for meetings and batch transcriptions
Added support for o1 and o3 models for the OpenAI AI service
Added a toggle for microphone recordings to enable speaker recognition in case you're recording multiple people at the same time
Added a retry button for when dictation fails with a cloud provider

12.1.1

New:

Support for more models for Deepgram, including their model specifically trained for the medical field
You can now choose to automatically save each transcript as a .whisper file. This will be the default behaviour soon, but you can already enable it from settings.

Improvements:

You can now undo accidentally deleting a speaker from the sidebar
Improved dictation with AI prompts so that the AI service does not reply to your dictation
Highlighted search words are improved in the transcript view
Added the option to also delete a file when you want to remove it from your history list

Bugfixes:

Fixed an issue where dictation could ask for an AI service even if it was not set up

12.0.1

Small bug fix update after 12.0 to fix a crash when opening the record meeting settings screen

New:

Automatic Speaker Recognition! Finally! Automatically recognise speakers in your recordings using local models. To use it, make sure you select a model that supports speaker recognition (WhisperKit). After your transcription is complete it will automatically be grouped by speaker. We're still working on improvements so let us know what you think! (Pro)
Click on segments in the transcript view to start playback from there
Play the first segment for an identified speaker from the sidebar to make it easy to identify which speaker is who
Added the option to automatically improve spelling and grammar for dictations without having to use a prompt (Pro)

Improvements:

You can now adjust the speaker for a paragraph from the transcript view
Assign segments to a different speaker using the keyboard shortcuts (1,2,3 etc)
You can now use cloud transcription models in the app without having to first download a local model
You can now reassign all segments from one speaker to another one
Speaker recognition now also works for M1 users
Added a badge to identify which models support speaker recognition
Made it clearer when the app is identifying speakers instead of it appearing like progress is stuck at 100%
Small design tweaks and bugfixes
Improved the design for prompts in settings
Speaker recognition is now also enabled for microphone recordings

Bugfixes:

Fixed an issue where extra spaces were added for some languages such as Thai and Chinese

11.13

New:

Added the option to show timestamps for grouped speaker paragraphs in the transcript view

Improvements:

New design for the speakers in the sidebar
Improved grouping of speakers by removing the 'Speaker 0' issue. More improvements are on the way.
Added some more empty state screens

Bugfixes:

Fixed an issue where the font size of the transcript view could not be adjusted

11.12

New:

First release with speaker recognition for local models! Try it out with a WhisperKit models (Pro only and M2 or newer only for now). Please send us your feedback on what we should improve.
New display mode where the transcript is grouped by speaker paragraphs. Only visible if multiple speakers are available.

Improved:

Improved performance while fast loading models

Bugfixes:

We're working on solving an issue on the latest macOS 15.3 in combination with M1 series Macs. WhisperKit models are temporarily not recommended for M1 series users while we figure out a good path forward.

11.11

Improvements:

Transcription complete notification are no longer shown when the app is in focus
Improved the performance of the markdown scroll view on the export page
Fixes a bunch of export preview edge cases
Improved the PDF export design
You can now hit return when searching to move to the next instance

Bugfixes:

If translation to a language fails for whatever reason it's no longer added to the recent list
If a WhisperKit model crashes the app will now turn off fast loading to prevent it from happening again while we wait for Apple to fix this bug in a new version of macOS

11.10.2

Bugfixes:

Deepgram can now be used for dictation mode again
Users who had their WhisperKit settings set to use the GPU are now properly reset to Neural Engine if they run into a macOS related crash
Small bugfix that nobody will notice

11.10

New:

Automatic Speaker Recognition can now be used with Deepgram and ElevenLabs cloud transcription. Local support coming very soon!
Added support for OpenRouter AI services
Added support for Deepgram Nova for cloud transcription
Added support for ElevenLabs Scribe for cloud transcription
Add support for ChatGPT 4.5

Improvements:

Improved the design and spacing in the pdf export
Automatically select the newly created Find and Replace item in settings
Settings loads faster the first time you open it
Adding a new prompt in settings is now faster since the sheet will appear immediately
Added a toggle to disable sending of anonymous telemetry data
New notification when a transcription made with a Whisper C++ causes a lot of repetitions
Fixed an issue where loading a WhisperKit model would crash with Fast loading enabled. If you get the message again, it should not happen afterwards.
Added a button to retry sending your dictation to a cloud provider or AI service if something went wrong
You can now move to the next found search result by hitting return
Improved the design of search highlighting

11.9

New:

Added initial support for Dvorak (and other) keyboards for dictation. Enable it from Settings > Advanced and let us know if you run into anything if you have an 'exotic' keyboard setup

Improvements:

The meeting detected notifications now automatically dismiss after ten seconds
Improved model selection flows in edge cases
Fixed an issue where dictation could crash with specific hardware
Clearer errors when a file can not be transcribed in batch mode
You can now switch between found search text results with ⌘G and ⌘⇧G
Improvements to translating with Apple Translation
Added support for Claude 3.7 Sonnet

Bugfixes:

Fixed an issue where you could not remove items from the batch list

11.8

New:

You can now choose when transcription finished notifications should be triggered based on the duration time to finish the transcript
Added the option to export as JSON (Pro)

Improvements:

Improved textfield detection in dictation (better support for Sublime and other text editors)
Improved the contrast of the start recording button on global in dark mode
Added a correct minimum width for the sidebar in settings
Hitting return during an active dictation no longer finishes the dictation

Bugfixes:

Fixed an issue where when you end a meeting the notification in the app could show Zoom instead of the app you were recording

11.7

New:

Huge speed and stability improvements for the dictation feature. Long dictations now appear almost instantly because your words are transcribed in the background as you're talking. Try it out and let us know what you think!
Added meeting detection for Edge
When scrolling through the playback bar, the transcript segment is highlighted
You can now name your microphone recordings

Improvements:

Improved loading of recorded Teams files when using WhisperKit models
Added support for the latest Gemini models
Improved meeting detection for Skype
Added a clearer alert when trying to use a cloud provider to transcribe a meeting since the file size limits don't allow it currently
You can enable the dictation feature to work everywhere, not just when a text field is detected. To turn it on, go to Settings > Advanced.
Clearer errors when LM Studio returns an error response
When during dictation no textfield is focused, your dictation will be added to your clipboard
Support diacritics in HTML export
You can now remove previously detected, but unused, microphones from the microphone priority list
Fixed an issue where the sidebar would jump when opening settings for the first time (finally!). If you still see it, let us know.

Bugfixes:

Fixed an issue where you could not switch AI prompts easily from the dictation bubble

11.6

New:

Free users can now use all models! See how much the quality improves when using the largest models. You won't be able to copy, export or otherwise use the transcript unless you upgrade to Pro.
Added the option to unload models after a fixed number of minutes. This can be useful for users with lower amounts of RAM. Models are loaded back into memory when starting a new transcription.
You can now choose to pause and automatically resume playing media when using the dictation feature
Active downloads now show in the overlay on the homescreen
Dictation Word Dictionary: Add words and terms that the dictation feature misinterprets. When an AI prompt is active during dictation, these words will be corrected automatically.

Improvements:

When transcribing meetings and other recordings with multiple recordings, the progress bar now shows more clearly how many files are remaining
Fixed the model and language picker designs in the sidebar and other places
Models are now loaded on first transcription instead of on the homescreen
The Assistant sidebar is no longer cropped when the window is very small
Speed improvements all across the app, it should feel a lot snappier!
Improvements to active meeting detection
Speaker colors that get assigned in meeting recordings are more consistent

Bugfixes:

Fixed an issue where the "Manage Models" button could flash on launch
Fixed an issue where the main app could not be opened from the global overlay if the main window was closed before

11.5

New:

Improved support for .oga files
Added support for more automatic meeting recording applications such as Amazon Chime, Skype, Discord, WhatsApp and web browsers
Automatic Meeting Detection now supports a lot more apps. Please let us know if you run into any issues

Improvements:

The summary feature now takes into account the original language of the transcript and will output in that same language by default
You can now choose the default name for you and other attendees in meetings
Fixed an issue where YouTube links could not be transcribed
Improved dictation performance to where it should not answer your dictations if you don't specify it as a custom prompt
Added a toggle to enable or disable automatic summarisation when you open the summary view
You can no dismiss detected meeting overlays
Better error handling in certain scenarios
Improved meeting detection in Zoom and Webex
Remove mentions of BLANK_AUDIO from empty dictations

Bugfixes:

Fixed an issue where you could not cancel a YouTube download

11.4.3

Improvements:

Clearer error messages when rate limited

Bugfixes:

Fixed an issue that could cause the app to hang.
Fixed a crash that could happen when using automatic meeting notifications

11.4

New:

You can now choose to save Global recordings in your history for easy access later (thanks Martin)
You can now easily remove segments that only include words with asterisks surrounding it
Added support for Deepseek as an AI service

Improvements:

Improved the design of the Record System Audio screen. It's now simpler to select which app you want to record and it's clearer when the microphone is recorded as well.
Added a recording time indicator for the Record System Audio feature
The app now remembers if you wanted to record your microphone during app audio recordings
When adding new speakers, their default color should no longer clash with existing speakers if possible
If no model name is selected for a Ollama or LM Studio service the title will show correctly
You can more easily add a new speaker by right clicking a segment, even if there already are speakers available
The speaker percentage view now properly adds up to 100%

11.3.2

Bugfixes:

Fixed an issue where a meeting notification could show the wrong meeting app
Improved detection of Teams meetings
Meeting app notifications are now displayed in middle top of the screen
When adding new speakers the color should be unique
Fixed an issue where the main app could not be opened from the menubar or the menubar would not show up.

11.3

New:

Automatic Meeting Detection! The app will now detect if you are in an active Zoom, Teams or Webex meeting and will notify you to automatically record the meeting. Free while in beta. (Pro)
Start recording a meeting straight from the menubar
The history sidebar is now grouped by days and weeks
You can now use app specific AI prompts with the dictation feature. The app can automatically switch them for you so you can use a different prompt while answering emails or while coding for example. (Pro)

Improvements:

Improved the design of the AI services screen in settings
Improved the design for the speakers section in the transcription sidebar
Speakers are added to the sidebar when recording meetings and podcasts again
The audio files from your microphone and the rest of the meeting participants are now saved together and are accessible from the history sidebar
Active meeting recordings are shown in the active task area in the bottom right of the main app
You can now record multiple meetings and then transcribe them afterwards
Fixed a flicker on the dictation settings screen
Left clicking the menubar now shows the menu, while right clicking opens Global
Added a button to check for updates and to open settings from the menubar
You can add new speakers from a right click on a segment again
Added info per speaker to the information sidebar. You can see words, characters and percentage of words spoken per speaker.
Improved how the app handles when it's only active in the menubar
You can now favorite, delete or assign speakers to a segment that you're hovering over, without having to select it first

Bugfixes:

Speakers are added in the sidebar when transcribing podcasts and system app audio again
Fixed a crash when using VAD with WhisperKit
Fixed an issue where you could not use AI features after opening a Global transcription in the main app
Fixed an issue where audio would not playback for podcasts or meetings until you saved
Fixed a potential crash related to a corrupt wav file

11.2

New:

Added a search bar in settings to more easily find all available options
New design for settings, please send us feedback on what you think!

Improvements:

Added support for the new Gemini 2.0 flash experimental model
The name for downloaded files is prettier
Added the option to show the full transcript on the Assistant tab
When selecting a prompt from the sidebar in Assistant Chat it is no longer automatically sent so you can adjust it
Cleaned up the homescreen a little bit
Added icons in the model picker on the homescreen

Bugfixes:

The Assistant tab now remembers if you last used the Chat or Summarize feature
The release notes message in the bottom left won't show on first launch
HTML previews now load correctly
Fixed an issue where text would blink while transcribing when using WhisperKit with VAD
Various small bug fixes around the app

11.1

New:

The search bar now shows how many matching words were found in your transcript
Added the option to use Voice Activity Detection for WhisperKit models. This will increase your transcription speed and will remove issues related to empty chunks of audio. Try it from Settings > Advanced (Pro)
You can now choose to only show the app in the Dock, the menubar or both
Transcripts created with WhisperKit models will now highlight individual words during playback on the transcripts view (Pro)

Improvements:

The html export now changes background and text colors based on light and dark mode
Dictations will no longer show up in clipboard history managers such as Alfred
Added a button to create a new folder when choosing your save location
The videoplayer playback speed matches the audio playback if you increase or decrease the speed
If a Whisper model can not be loaded on launch, the next best model will be loaded
You can now increase and decrease the font size with ⌘- and ⌘+
You can increase or decrease the playback speed of the player with the < and > buttons on your keyboard
The copy button now takes into account the display mode you have selected
Whisper files show a nicer filename
The export preview text now shows your entire transcript
When translating you will see a "Translating..." indicator
The sidebar now animates nicely when switching between full and compact mode
You can now add timestamps and speaker names when using AI features.
You can now remove translations by right clicking them in the sidebar
Added quick options for combining segments to sentences and for removing "- " at the start of segments

Bugfixes:

You can't click the "Start" button in global mode anymore when no models are loaded
When starting playback by clicking on a segment, the videoplayer will now be sync

11.0.1

Bugfixes:

After exporting, the MacWhisper window doesnt disappear any longer.
Fixed an issue where it might not be possible to setup Dictation using a shortcut that was already configured previously.

11.0

New:

Completely new design for the transcript view, with a convenient sidebar for easy access to the most used features
Adjust the font size on the transcripts and segments views from tiny to very large
Collapse the sidebar for a focused view of your transcript
You can now choose to show padding around your transcript for a cleaner view
Speakers are now added on a per transcript basis and can be added more easily from the sidebar
Added a clearer Pro overview screen in settings
Improved transcript view design with flexible sidebar
Add speakers directly in the transcript view
View information about the current transcription on the new Info tab
You can now choose to use the right option key for dictation

Improvements:

When retranscribing with a different model the homescreen won't flash anymore
All sound effects are now at the appropriate volume (thanks Konstantin)
You can now adjust the colors per speaker from the speaker sidebar section
You can now assign speakers to a segment by using the 1,2,3... keys on your keyboard
Segments now have a background color that matches the speaker associated with that segment
Responses on the AI screen will stay visible when switching display modes
Improved the design of the AI Services view in settings
Faster performance of the preview on the export view
Added a copy button to the toolbar for easier access

Please let us know what you think of the new redesign and if you run into anything that can be improved by emailing us!

10.9.2

Improvements:

Added a "Don't show this again" option to more dialogs. These can be managed via the "Show Save Confirmation" preference in Settings.

Bugfixes:

Fixed an issue where the "Don't show this again" dialog preference might not be saved correctly.

⚠️ Last Ventura Update

This is the last update that supports macOS 13.0 (Ventura). Please update your Mac to 14.0 or higher to use new features we are adding to MacWhisper. If you run into issues on this version on Ventura please let us know and we'll try our best to fix them so that the app is as stable as can be for users on 13.0.

10.9.1

Improvements:

Fixed an issue where Microsoft Teams recordings would sometimes fail to load
Added a clearer "Do not ask again" button to the save transcription overlay
Fixed an issue where dictation would paste clipboard content
Improved support for detecting textfields in apps
Dictation now works with ChatGPT, Anthropic and other overlays
Video player content is now in sync with audio

Bugfixes:

Fixed an issue where release notes would be shown in the wrong scenario
No longer adds a file in the user's documents folder. It can be safely deleted after up

10.8.1

Bugfixes:

Dictation: Fixed missing spaces bug when dictating into some browser fields

10.8

New:

Dictation is up to 10x faster for longer chunks of text
Added a setting to launch MacWhisper at login

Improvements:

Improved the position of the dictation overlay on secondary displays
Improved compatibility of the dictation shortcut keys
The changelog is not shown to users who have not seen the onboarding yet
Defaulted the WhisperKit settings to use the Apple Neural Engine
Fixed an issue where the Global view keyboard shortcuts would stop working after opening it multiple times

Bugfixes:

Fixed a crash when editing the last segment in a file

10.7

New:

Custom keyboard shortcuts for dictation are back! Besides using the Fn or right Cmd key, you can again choose your own keyboard shortcuts to start and stop the dictation feature. Thanks for the feedback!

Improvements:

Fixed an issue where some files could not be loaded with a WhisperKit model.

10.6

New:

Watch Folders: Add folders that you want the app to observe, and whenever a new compatible file is added you can quickly transcribe it. Send us your feedback on how we can make it better for you! (Pro)

Improvements:

You can choose to toggle Dictation instead of having to press and hold the dictation key. You can adjust this from Settings > Dictation.
Updated to use the latest Claude 3.5 Sonnet model
Your old dictation keyboard shortcut gets disabled after enabling the new dictation features
Use the correct keyboard glyps in dictation settings
Added extra options for MDM deployment to disable AI services and Cloud Transcription
Faster performance when using WhisperKit models

10.5

New:

Push to Talk Dictation! We've reworked the dictation experience to be faster and more convenient. Just press and hold one of the dictation buttons you choose, talk, and release to type in any textfield on your Mac. Enable it by clicking the Dictation button on the homescreen.
Full Support for Writing Tools on macOS 15.1. Locally summarize, rewrite and improve your transcripts with Apple Intelligence
Dictation and Global history, view your past 50 dictations and copy them to reuse
Added support for Google Gemini AI models (Pro)

Improvements:

Improved the textfields when adding your own AI prompts

Bugfixes:

Fixed an issue where dictations could take a long time to complete
Fixed an issue where transcriptions couldn't be saved when editing a segment
Fixed an issue where the settings window would disappear when opening the app with it open

10.4

New:

Added support for LM Studio for very fast local AI models
Added support for the xAI API

Improvements:

Removed the num_ctx parameter for Ollama, which should improve performance, let us know if you run into anything.
Tweaked the design of the select model button on the AI page.
The inline video player no longer shows up on the AI screen where it overlaps with the prompt view.
Hide the "Recently Used Languages" section in translation settings if there are no languages yet
Fixed a strange animation in the onboarding
Improved the design of the AI services screen
Show clearer error when you're trying to use the MacBook microphone with the lid closed

Bugfixes:

Fixed "No context length could be determined" error for some Ollama models
Fixed an issue where two of the same microphones could appear in the microphone priority list
Fixed "Finished without any text" glitch when using Cloud transcription

10.3.1

New:

Added support for using Groq as a Cloud transcription provider (Pro)
You can now configure which microphone should be used for recordings. From Settings > Microphone you can choose 'System Default', 'Fixed Microphone' or 'Priority List'.
Added a changelog screen for larger updates to highlight new features

Improvements:

Find & Replace: If case sensitive is turned on, then the replacement should also be case sensitive.
Find & Replace: make regex search pattern safe for special characters such as !, ? etc.
Textfields look nicer in settings
Improved button placement and design in settings
Ollama models now have a higher token limit

Bugfixes:

Fixed minor memory leak when undoing changes

10.2

New:

Default Batch Export settings. You can now setup which formats should be used for batch transcriptions from Settings.

Improvements:

Improved performance of batch transcription screen when transcribing more than 20 files in one go
Design tweaks for the batch transcription view
You can now use a custom OpenAI model as well (for anyone with access to gpt-5)

Bugfixes:

Fixed an issue where the internet connection was checked too often for people whose internet connection dropped out

10.1

New:

Add support for OpenAI hosteda on Azure
The Global feature is now also available to free users

Improvements:

Added a delete button to ai services
Added clearer errors per AI service provider
Clarified what urls are valid for custom endpoints
New icon for custom AI services

Bugfixes:

You can now add multiple Groq services and use different models for each
Fixed an issue where full transcripts where exporting with timestamps
Timestamps in segments exports are now displayed correctly

10.0.1

Improved:

Small fixes and improvements

10.0

New:

Added the new Whisper Turbo model which has the same accuracy as Large, but can transcribe at 20x realtime. Try it out!
Local AI Models with Ollama support. You can now use any AI model that you run through Ollama on your Mac.
Custom AI providers. You can now add your own custom AI providers which use the OpenAI API spec. Add them from the AI Services tab in settings and then use it across the app.
Grog AI support. Use the Grog service to run AI prompts on your transcripts with your own API key.

Improved:

Global and Dictation now also supports removing duplicates as well as your ignore/replace list.
The Global window can't be dismissed while it's transcribing to make sure your recordings don't disappear accidentally.
Cleaned up the model picker dropdown to more easily differentiate model capabilities and engines.
Timestamps can be enabled again in segment exports.
Global timestamp changes now show up correctly in exports.
You can adjust timestamps correctly now on segments, even if they go to the next minute.
The Manage Models screen is more resilient to a lost network connection.
Extra checks to prevent WhisperKit models from showing up on Intel devices (where they are not supported).
Added extra "Copy logs to clipboard" option to help debug issues.
Improved the design for the AI Prompts page.
Prompt titles and prompts now have a nicer sheet with more space for editing.
Added descriptions for the different export options.
Dictation now works correctly in Arc Browser

9.15

Improved:

Fixed an issue where the app could crash on macOS Sequoia
Fixed an issue where quick exports to full transcripts would include timestamps
WhisperKit models are now shown in the manage models list by default as well. Try one of the out from the dropdown in the top right for even more accurate transcripts.
Improved the UX around the Global mode. If you have stay on top enabled but have not started a recording yet the window will still dismiss.
When you dismiss the Global mode while a recording is active, it will still be there when you open Global again
Improved compatibility for more mp4 files

9.13

New:

You can now easily re-transcribe using a different model or input language, directly from the Transcripts screen.
The Segments screen now supports scrolling up and down using the keyboard while the text fields are focused.

Improved:

Added a "You must first fund your OpenAI account to use this API key" message to the OpenAI Settings screen, if we detect the account has run out of credit.
Migrated WhisperKit (beta) models out of the Documents folder. If model files are offloaded to iCloud and not available locally, the WhisperKit models will need to be re-downloaded in the Manage Models screen. Please note that the first load of these migrated models may take some time as they are optimized for your specific system configuration.
Resolved an issue where OpenAI Cloud Transcriptions would not stop correctly if cancelled midway through the process.
Fixed an issue where YouTube downloads would continue running after the download was cancelled.

9.12

Improved:

More reliable "typing" output support for the Dictation feature, with fixes for specific apps such as rich text fields in Chrome and Firefox. Please email us at support@macwhisper.com if you still run into issues!
File History list is now more resilient at opening files that have been moved.

9.11

New:

Support for dictation recordings under 1 second has been added.

Improved:

Typing dictation output is now delayed until all keyboard shortcut modifier keys are released.
Enhanced error handling when dictation does not have microphone permissions.

9.10

New:

You can now also use Anthropic as the dictation AI service.
You can choose which model to use for dictation (in Settings), separately to the model you use on the AI Transcription screen.

Improved:

When modifying a dictation prompt, the matching active prompt will now also pick up the change.
Improved accuracy of dictation typing, and squashed a few bugs (for example: typing times into Apple Mail now works correctly).
The main MacWhisper window no longer appears when dictation is activated.
The default Dictation AI Prompts have been improved to deliver better responses from the LLM.
WhisperKit models stream responses into the Transcript view. This release fixes a UI glitch.

9.9

Improved:

Dictation: Fixed an issue where dictations could have spaces in wrong places
Dictation: Fixed an issue where two spaces could turn into a period
Global: Fixed an issue where the audio could not be saved when opened in the main app
ChatGPT: Added support for "GPT-4o Latest" and "GPT-4o 64k Output Alpha" (this last one requires you have access to it before you can use it)

9.8

Improved:

Fixed an issue with the YouTube downloader not working correctly
When you have the 'Translate to English' feature enabled it will now be shown on the home screen

9.7.1

Improved:

Dictation now also works in Mimestream and Microsoft Office apps. If you notice it does not work in a specific app let us know at support@macwhisper.com
The default OpenAI model is now GPT4-o mini
The dictation feature will now use the OpenAI model you have enabled, instead of always using GPT4o
Fixed an issue where YouTube videos were not downloading because the 'High Quality' setting was turned on. We disabled the setting for now.

9.7

New:

Auto Translations. You can now automatically translate transcripts in specific languages. Add the languages you want to translate, and what languages to translate them into from settings.

Improved:

Added a toggle to Global Find and Replace to only match on complete words or also on parts of words. Disable this for languages that don't use spaces to separate words such as Chinese.
The Global window now shows above all other apps even if you disable float on top
When performing batch transcriptions, you will no longer see the transcripts happening in the background.

9.6

Improved:

Added support for the new GPT4o 2024-08-06 model which is cheaper and has 16000 output tokens so it can be used for large summaries
The AI response view now shows markdown text properly
Small fixes around dictation and prompts

9.5.1

Improved:

Added a toggle to show Global on top of all other apps and keep it there until you manually dismiss it (thanks Christopher)
Global can be dismissed with Escape (again)
Fix for the manage models screen popping up on every launch if you only had WhisperKit models downloaded (thanks Anthony)
Fix for the dictation feature asking for an OpenAI key even if you weren't using a prompt
Fixed an issue where export styles were not displayed correctly in the quick export menu (thanks Steven)

9.5

Improved:

Improved the preview of Markdown exports
Removed the option to export a full transcript as Markdown as it was not working
Fixed an issue where no spaces were shown in your dictation if you used a prompt (thanks Roel and Corey for reporting)
The currently active prompt (or lack thereof) in dictation mode is now persisted across dictation sessions

9.4.1

Improved:

Fixed an issue where the audio from the previous transcription would play when opening multiple files
When using ChatGPT Prompts with dictation mode, the results are now typed as it comes in

9.4

New:

Clicking the menubar icon will now open the Global window for a richer experience
Support for Markdown (.md) exports

Improved:

Dictation works in a lot more apps now and is more reliable
The Dictation overlay is presented in the right place almost all the time now. If it doesn't let us know!
When you add Whisper Transcription as a login item it will automatically be hidden when launched at startup of your Mac
Batch files are now transcribed in the correct alphabetical order
Export button is shown again when opening .whisper files
Show model tags for downloaded and downloading model cells
Buttons to quickly go to all models in the downloaded models screen
Fix for activating WhisperKit models from the Manage Models screen
Global now closes when you select “Open in main app”

9.3

Dictation improvements:

If a blank dictation is detected, show an error in the overlay
Rewritten typing engine, which is more robust (allowing streaming in future)
Improved support for Safari and Firefox for Google Docs and ProtonMail
Support for BBEdit 14
Support for more electron apps

Fixed:

Anthropic prompt never stops showing loading
Export button is shown again when opening .whisper files
Segment textfield: ⇧+return should always split the text instead of unfocusing
Segment textfield: Allow splitting of line even when half the split is empty string
Segment textfield: Can press Up key on empty line to go to previous segment
Removed flag emojis

9.2

New:

Added support for the new GPT-4o-mini model which is very cheap and still allows 128.000 tokens

9.1

New:

The app is now localized into Dutch, more languages coming soon

Improved:

Fixed some localization issues
Models are now sorted in the right order in the model picker dropdown
Global and menubar no longer save recordings to your preferred folder
Improved the performance on the global find and replace settings screen if you have a lot of replacements
Stopping global with Escape or the keyboard shortcut now correctly ends the recording
When a transcription is empty this is now also shown on the segments page

9.0

This update introduces the long awaited Dictation feature! You can now access MacWhisper's high quality dictation feature in any textfield on your Mac! Set up a keyboard shortcut from settings to get started.

Besides regular dictation you can also combine it with AI prompting. Automatically let ChatGPT rewrite, translate, clean up or convert whatever you dictated into the format you prefer.

Dictation is available to all users for the next few releases while we make sure it all works well. Please let us know if you run into anything! Jordi & Ian

New

Dictation! Access high quality transcription anywhere on your Mac
Added support for .m4b and .flac files

Improved

When exporting srt subtitles with speakers, there is now a space after the “:”. (Thanks Marabel!)
API keys are now hidden in settings by default and look nicer
The model download page is a lot simpler and clearer
The language selector in batch settings looks correct again
Updated to the latest version of WhisperKit
Updated ignore word list based on your feedback, thanks!
Fixed an issue with Anthropic Claude 2.1
Fixed an issue where microphone recordings could not be used with the OpenAI cloud transcription feature
Global view is now always on top so you can keep using other apps while the transcription is active
Global and menubar features now also work with cloud transcription

8.11 New

You can now use the Cloud transcription feature for all parts of the app. Just select it from the model picker dropdown.
Export translated transcripts from the export dropdown. When using the export dropdown with a translated transcript you can now choose which version of the transcript is exported.

Improved

Fixed an issue where some old .whisper files could not be opened.
The app shows a progress bar when loading models now.
Decreased the size of whisper files with audio by converting the audio track of videos to m4a.
Videos automatically show up in picture in picture mode after loading.

8.10 New

You can now import and export your global replace list. This makes it easier to share common word replacements with friends or colleagues.

Improved

Loading .whisper files with large video will now be 99% times faster since we first load the text and then later the media file!
Find and replace settings now looks nicer and the filter works better
When adding a new speaker the name textfield is auto selected to make your workflow faster
Extra checks for validating your DeepL license key
Fixed an issue where YouTube videos were not downloading if you had “Download video file” enabled

8.9.1 New

Added support for the latest Anthropic model Claude 3.5 Sonnet
You can now adjust the starting timestamp for the transcripts. Useful if you’re working with timecoded files. For example, you can now set the starting timestamp of the first segment to 01:00:00 and then all other timestamps will update accordingly. You can access this setting from the menubar > Transcript.

Improved

When loading a large .whisper file you can now cancel the loading process
Fixed some keyboard interactions and sound effects that were happening when they shouldn’t have
Fixed some memory leaks
Improved performance when switching between display modes
Updated the ignore word list based on your feedback
Changes in the segments view flow over to the AI view
Hitting Shift-Return now creates a new segment. Before this was done with just Return, which will now commit the change in the textfield.
Fixed a crash in system app audio recording
The segments view animations are less messy when searching

8.8

Improved

Fixed an issue where audio decoding could fail when using WhisperKit models. The app now falls back to an alternative audio decoding strategy.
The AI features now use the correct token limits for each model

8.7

New

Added a "Replace" button for replacing individual words alongside "Replace All".
Select all segments up or down from the current selection using ⌘+⇧+⇧ or ⌘+⇧+⇧.

Improved

Fixed: “System Audio Recording” and “Transcribe Podcast” sometimes didn’t display all tracks in the finished transcript.
Fixed: Crash when starting Global transcription via ⌘+R with an ongoing transcription.
Find and Replace now supports Undo (⌘+Z).
Models causing crashes won't be automatically loaded on next launch.
Corrected timestamp duration when backspacing to join two segments.

8.6

New

Added emojis for each language so they’re easier to find
Fast switching between translated languages in a transcript. Just tap the the flag to switch to another language.

Improved

This update has a lot of design tweaks. Nicer buttons with prettier corners, cleaner header bar and playback bar
Cleaned up the close buttons
Added some more colors
The url input bar shows an icon depending on what type of url you paste in

8.5

Improved

The menubar app will only copy your transcript to the clipboard when it’s finished
Clearer errors on the upgrade to pro screen if you don’t have an internet connection
Improved handling of in app purchases when you are offline
Fixed a crash when trying to add more than two speakers when not a Pro user (thanks Tom!)
The language and quality badges are shown again in the header bar, oops!
Added more words to the ignore list based on your feedback, keep submitting them please!
More improvements to .ogg support

8.4

Improved

Fixed an issue where the devices list was not updated correctly when a microphone was disconnected.
System App Audio recordings are now automatically merged before being saved in your history.
System App Audio playback now plays back both audio tracks correctly.
Cleaned up the design for the System App Audio screen.
When you close the app you are recording, you can continue it later.
Improved an issue where some people were getting a “Runner not ready” error.
You can now control more settings related to WhisperKit.
The menubar icon will now show a different icon while transcription is happening.
If you’re exporting a CSV with speaker names but a column doesn’t have speakers, it’s now empty instead of filled with a timestamp.
Fixed an issue where .ogg files weren’t working for people (thanks Raymond!).
Added the option to choose if you want to save only the audio or also the video file when making a whisper file for a video.
Some small design tweaks.

8.3

New:

You can now quickly toggle the “Translate to English” option from the Transcript menubar (⌘T)

Improved

You can now open the find and replace bar from the menubar or with a keyboard shortcut (⌘⌃F)
If you have the “Translate to English” setting enabled it will now be shown as a badge on the transcript view as a reminder
Fixed an issue where the headerbar did not have the correct padding
The language selector now shows languages you have not used before in a deeper menu to clean it up a bit
You can now change the input language from the “Whisper” menubar item

8.2

New:

Added a new quick export dropdown menu next to the Export button for even faster exporting

Improved

Fixed some of the names for specific models
Improved the Ignore Segments feature and made it only available for Pro users
The language and used engine are now shown when opening .whisper files
Added a warning before specialising WhisperKit models if your Mac is on low power mode

8.1

New:

Ignore Segments. Commonly used filler segments can now automatically be removed to clean up your transcripts. And you can also add your own from Settings > Ignored Segments.
Right click a segment on the Segments view to ignore it in the future
Word Level Timestamps. You can now split up segments per word when using Whisper C++ models. Enable it from Settings > Advanced.

Improved

Video files with capitalised file extensions now also show the videoplayer (thanks yuniancong)
The timestamp for the first segment now starts at 00:00:001 for better compatibility with srt files
WhisperKit is now disabled on Intel Macs and on Macs running Ventura

8.0

New Features:

📺 Video Player: Now, when transcribing video files, an inline video player is available! It can also be popped out into its own window. Subtitles display directly on the video, and translations appear as separate subtitles too.

🏎️WhisperKit Support: Choose different Whisper engines for your transcriptions. WhisperKit offers distilled models for speed, and transcriptions stream in real time. Enable WhisperKit in Settings > Advanced.

Improvements:

If you have a character limit set, the app will not cut off words in the middle of a word.
New menubar icon that doesn't conflict with the standard microphone icon.
Quality and language selectors moved to the toolbar. Expand your window if they're not visible.
Opening .whisper files is now possible while models load.
Updated to the latest Whisper C++ engine, now with Flash Attention (activate in Settings > Advanced).
Redesigned Manage Models screen for easier model selection. Feedback is welcome.
Enhanced error handling for model downloads.
"MS Teams Virtual Mic" excluded from microphone options as it's not an actual mic.
Fixed a bug where invalid license error codes weren't displayed.
Resolved a crash when non-pro users added more than two speakers.
The Esc key won't close screens during active processes like recording or batch transcription.

Global:

Keyboard shortcut modifiers now displayed in the UI (⌘+R etc).
Improved button design.
Fixed transcript copy errors.

YouTube:

Faster YouTube downloads.
Option to download only audio or video from YouTube.
Downloads play in the mini-player.
Choice of high or low video quality.

Cloud Transcription:

The Cloud Transcription feature now only lists the languages that are supported (57) compared to the 100 that are supported locally
Fixed the bug where m4a/mp4 files were being rejected even though they are supported
All the file formats that the local transcription mode supports are now supported for Cloud as well

ChatGPT:

Now always shows the network error if there is one.
Support for latest models (latest GPT-4 Turbo, and GPT-4o!)

7.13

Improved:

Fix phantom window being opened on Ventura when opening a file from Finder.
Fix issue with nothing happening when opening a .whisper file from Finder on Ventura.
Present error if there’s an issue when restoring a purchase.
Remove “copy all” keyboard shortcut override (⌘+C) on Transcripts screen, so that you can still copy a text selection
Commit pending textfield change when opening Export sheet, to ensure that an export contains the most recent changes.
Add a note for Cantonese language import, stating it works best with the Large v3 model.
Reset AI output when running a successive prompt.

7.12.1

Small bug fixes

7.12

New:

Added support for Anthropic Claude as the AI provider. Use the powerful Claude models to perform AI features on transcript with up to 200.000 tokens.
Enhancements to the translation process during export. Now, on the export page, you can choose which language you wish to export to.
Translations are now stored in your .whisper file.
Laid the groundwork for making the app available in other languages.

Improved:

Introduced a button to facilitate translating the app into your native language. It will remain accessible until we've recruited sufficient translators.
Eliminated a flicker observed when initiating a new transcription.
The translation button now displays the currently visible translated language for clearer language identification.
Implemented comprehensive undo/redo functionality for translation entries.
Bugfix on Undo when switching files
"Combine Segments into Sentences" also supported for translations now.
Improved the check for active Pro subscriptions. If you run into issues with your Pro license please contact us.

7.11

New

Combine Segments into Sentences: Under "Transcript → Combine Segments into Sentences," you can now transform your fragmented segments into complete sentences, while preserving timestamps and metadata like favorites and speaker assignments.
Audio Track Saving: When transcribing videos, now only the audio track is saved in the .whisper file, reducing file size and loading times.
Exporting Segments: You can now choose to export segments with or without including milliseconds.
Added compatibility for .aac audio files.

Improved

Resolved an issue where .whisper files with an embedded .mov would not play back audio.
Microphone Selection is saved: The app now remembers the last-used microphone selection correctly.
Fixed sentence grouping in HTML and PDF exports.
Cloud Transcription: Increased network timeout duration to accommodate longer transcriptions.
Fixed a crash when using the "File → Export → Whisper" menu option.
Media is added to history even if transcription is cancelled.
DeepL API Key Error: Now shows an informative error if the DeepL API key is invalid or expired.
Resolved a UI glitch in Batch Settings.
Disabled the close button on the manage models screen while downloading.

7.10

New

"Record App Audio" and "Transcribe Podcast" features now support playback of all recorded audio tracks.
Additionally, there is an option to export a merged audio track combining all recorded tracks.
ChatGPT: Introduced support for gpt-3.5-turbo-0125.

Improved

Addressed a crash on Intel Macs during "Record App Audio" sessions. While we continue to investigate the root cause, recordings on Intel Macs may instead stop with an error within the first five minutes.
Resolved a bug in macOS 14.4 where the main MacWhisper window disappeared upon opening a .whisper file from Finder.
Provided a link to create an OpenAI API key.
YouTube: Expanded support for audio streams, increasing the likelihood of successfully downloading from a YouTube URL.
YouTube: Enabled downloading the same YouTube video twice without a “duplicate file” error.
YouTube: Improved error messaging to clarify the cause of download failures.
Added visual indicators for keyboard shortcuts to adjust playback speed (⌘+ to increase and ⌘- to decrease).
Ensured that dismissing a Batch transcription properly resets it for future use.

7.9

New

Added support for .ogg and .opus files
You can now translate files into multiple languages

Improved

Improved the translation feature. You can now choose to automatically translate the full transcript (with context) and the segments individually as well. Or you can choose to only translate the active display mode
Removed the emoji on the podcast view
You can change the colors again on the podcast view
You can now open whisper files made with pro models even if you are not a pro user

7.8.1

New

Cloud Transcriptions. You can now choose to use the OpenAI API version of Whisper to transcribe your files. This will run the transcription on their fast servers at the highest quality. Great for if you’re using an older Mac. Note that this requires an OpenAI API key, and will thus cost money. Files sent to OpenAI are no longer only stored locally on your Mac so be aware of this for private recordings.
You can now choose which microphone is used in Global and the menubar app
Added a tile on the homescreen to open the batch transcription feature
Added keyboard shortcut hints in Global mode
You can now adjust timestamps for segments. Click on the timestamp to adjust the start and end time by whole seconds. This is still very early so let us know what you would like to see us improve!

Improved

You can now copy your entire transcript with a keyboard shortcut (⌘+C)
When copying your transcript it will now take into account if you are in sentence or full transcript mode.
When autosaving you will not see “Saving…” anymore in the header bar
Audio playback stops when closing the transcription window
Merging multiple segments into one now happens in the correct order (thanks Nathan!)

7.7.1

Fixed

Fixed an issue that would prevent the app from saving

7.7

New

Changes to .whisper files are now automatically saved when you go back to the main screen
When you’re editing .whisper files, the app will autosave changes every 10 seconds
You can now pause microphone recordings
You can now merge multiple segments into one by selecting multiple and then selecting “Merge Segments” from the context window
You can now ⌘+ tap on the textfield in a segment to select the segment itself instead of the textfield

Improved

Saving and opening .whisper files is now 500% faster
YouTube transcripts now use the video title as the filename
YouTube transcripts should appear even faster
When the whisper model can’t be loaded the alert will show the error code for easier debugging
You can now change the “Show Timestamps” and “Large Font Size” settings from the View menu in your menubar
Fixed an issue where the English only models could still use the last language used that was not English
Improved the performance when showing large transcripts
When you make changes to the segments they now show up in the ChatGPT display mode

7.6

New

You can now choose which OpenAI model is used for the ChatGPT screen (GPT 3.5 or GPT 4)

Improved

Cleaned up the some UI glitches in the batch screen
Fixed an issue where the batch screen may not always appear
Added an extra explanation to the Auto detect language button that explains how the language is determined
Fix for custom model-loading crash, and strategy to prevent in future
Added a retry button to the model downloader screen in case the list of models could not be downloaded
If batch transcription fails the app shows a clearer error
Small design tweaks on the System App Recording screen
YouTube downloads work more reliably and should not error out

7.5

Improved

YouTube transcriptions should download a lot faster now
Improved transcription quality and performance on large audio files (30+ minutes)
The back and forward skip buttons now both work in their correct direction
Fixed an issue where the progress and scroll back buttons would overlap
You will no longer see an error when you cancel a 'Save As...'
Make it clearer that on first use of the recording feature you have to choose a folder to save the files to

7.4

New

You can now drag in your own custom GGML models to use (Pro only). Use this with custom models trained on specific languages or datasets. You can download these from sites such as Hugging Face.

Improved

When you add a new segment by hitting Return at the end of the line, the new segment will be automatically focused
When the microphone is unplugged during a recording your recording is saved properly and you see an alert notifying you about what happened
The milliseconds display when timestamps are visible now uses your Locale settings to determine if a period or comma should be used.
When you scroll the segments page during playback, autoscroll will briefly be disabled and a button appears to enable it again.
Fixes and improvements to playback performance on segments view.

7.3.1

New

You can now rename the audio files in a System App Recording transcript before the transcription starts. This way you can tag your microphone audio as you, and for example the Teams audio as the name of a colleague
You can now favorite or unfavorite a selected segment (or segments) by hitting the F key.

Improved

Fixed an issue where files could not be saved if you did two transcriptions in the same minute
Small tweaks to the transcription screen if no transcripts were generated
The current selected segment is no longer unfocused if you assign a speaker to it with the keyboard shortcuts
The global find and replace feature is now case-insensitive by default. You can still toggle this on if you prefer that behaviour.

7.3

New

MDM deployments can now disable remote features such as translation and ChatGPT
Audio files from system app recordings are now saved to your history for easy access
Microphone recordings are saved to your history as well
You can create an empty segment after the current one by pressing Return at the end of the textfield
You can skip forwards and backwards in the audio by 5 seconds with the mediakeys on your keyboard

Improved

Fixed an issue where opening a .whisper file could present an alert but still open the file correctly
You can now save active .whisper files with Save, without being prompted to overwrite the file
Fixed an issue where system audio recordings would stop without an error when putting your Mac to sleep
Fixed an issue the system audio recording screen could not be closed
Fixed an issue where the Pro status was not checked correctly which could lead to some users not being able to access a Pro feature
Search highlights now appear in yellow for better legibility
When editing a segment, it will stay active even if the highlighted segment that's currently playing changes
You can transition from one segment to the next by pressing the left and right arrow keys at the start or end of a segment
Improved then connection to OpenAI for the ChatGPT feature to prevent timeouts for long transcripts
You can now change the playback speed by hitting Cmd - + and Cmd - -
You can go back one step in playback speed by holding shift and pressing the speed button
Disabled the copy button on the Export screen for binary files such as pdf and whisper to prevent crashes

7.2.1

New
Assign speakers directly with keyboard shortcuts (⌘+1 etc)
Improved
The app shows the detected language when using the Auto Detect language setting
Fixed an issue where (1) was wrongly being added to an exported file even though there was no file with the same name in the directory
Translated text now also shows as sentences if you enable the sentence mode in the Segments display mode.
Fixed an issue where the export format wasn't displayed correctly in the batch export screen
Available microphones in the picker are now shown faster
When you disconnect your microphone during a recording, the recording is saved for transcription still
After you delete a segment, the next one is selected for easier editing
Batch files are now sorted alphabetically
The model quality is now shown while a transcription is active

7.1

New
You can now add your own custom prompts to use with the ChatGPT feature
Improved
Improved the quality of longer transcriptions.
Your last used export method is remembered
You can group the full transcript export by sentences again even if you have not added any speakers
Fixed export for docx in batch mode
Batch exports will no longer overwrite files with the same name but will add (1) instead
The prompts you use will be sorted by most recently used

7.0

New
ChatGPT integration! Add your own OpenAI API key and process your transcripts directly with ChatGPT. This is an early version, so I would love to hear your feedback! It requires you have access to GPT4-Turbo in this version.
You can now use the new export styles in batch mode as well
Improved
You can now export again from the menubar, using the new styles
The speaker paragraphs export option works again even if you have not added any speakers
When batch exporting, the app will no longer overwrite existing files with the same name.
You can now choose how you want to group the full transcript export (full, segments or sentences)
System App Audio recordings will now stop if you close the app that you are recording

6.11

New
New and improved export screen. You're now able to customize what your exports look like in more detail. More improvements are coming over the next few weeks.
Improved
Fixed a crash when trying to overwrite a file during export
You can now close the Global window by using the Escape key
Added a nicer gradient to the top of the home screen
Added some shine animations to the home buttons when hovering over them
The Global view will no longer show transcripts created in the main app

6.10

Improved
When you enable Auto Start for Global mode, you are now able to start a new recording immediately when opening it again. Before it would not show the back button.
Scroll indicators now are positioned correctly on the home screen
Transcripts created in the main app are no longer shown in the Global window
The settings window will no longer ask you to save you transcript before closing it

6.9.1

Improved
Add a button to show the sidebar in settings and added titles for each section

6.9

Improved
Redesigned the settings screen to provide more space for future features.

6.8

Improved
The app now stays open when you close the last window again

6.7.1

New
You can now adjust the Whisper temperature from Advanced settings
Improved
Set the max beam size to 5
Gave some Whisper errors clearer descriptions

6.7

New
Added the option to use Beam Search instead of Greedy for improved transcription results. If you are running into duplicate segments, give it a try and let me know if it solves your problems.
Improved
Made some error messages clearer
Fixed an issue where sometimes a file directory could not be opened
Fixed an issue when splitting a segment

6.6

Improved
Improvements to the Global experience, thanks for the feedback
The support buttons on the homescreen buttons work again

6.5

New
You can now open files and folders in MacWhisper directly from finder. Right click > Open with...
Improved
Removed the filename from the toolbar for now to not push the extra buttons to the more menu where they don't work

6.4

New
New Global mode! Access high quality transcription from anywhere with a keyboard shortcut. A spotlight like window will appear where you can immediatly start recording. The finished transcript can then be (auto) copied to your clipboard for easy pasting anywhere on your system.
Press backspace to delete a selected segment in the Segments view
You can now search through the Global Find & Replace list
Improved
Fixed padding and design issues on the System Audio screen
The microphone recording screen now loads a lot faster
Added a new experimental way to decode audio files that are giving problems. You can enable this from Settings > Advanced
Transcriptions made in the menubar app no longer automatically show up in the main app

6.3

New
Added support for the new Large V3 model for even higher accuracy
You can now change the language and model you want to use in the System Audio Recording screen and the Batch Transcription screen
Improved
Moved the progress bar into the header for a cleaner look
The filename of the currently open file is now shown in the toolbar
The time remaining and total progress is now shown during batch transcriptions
Added a way to send Diagnostics Reports to us to help solve problems in the future
The “Manage Models” button is now accessible again from the selector in the bottom left of the main screen
The export format selection buttons are more easily tappable
Fixed the counter in find and replace to be accurate
Added the option to manage your Pro subscription

6.2

New
The progress bar now shows an estimated time remaining alongside the progress percentage
You can now add a prompt to use for your transcription. This can help the app to better understand the context of the audio file you are transcribing. Examples of prompts would be: "this is a conversation between two English people" or "this is a conversation about rockets, words used are [ROCKET RELATED WORDS HERE]". You can find this feature under Settings > Advanced
Add an option to hide milliseconds in the timestamp view on the segments page
Added support for Undo and Redo in the segments view
Improved:
Extracted whisper files are removed from the temporary folder after they are opened
Created wav files used during transcription are removed from the temporary folder when the transcription is finished
It is now easier to open the audio files that are recorded during a System Audio recording.
When searching in the transcript view, the page will scroll to the first occurence of the word and not just highlight it
Added help text to the homescreen buttons to make it clearer what they do
Added a button to open the Manage Models screen from the main page.
On the Record System Audio screen the list of open apps is now updated live when you open or close apps
You can now press the Return ⮐ key while you have a segment selected to edit the text
You can hit the Escape key to unselect the text
You can now navigate between selected segments with the arrow keys on your keyboard
You can now select multiple segments and then hit ⌘+C to copy them as a whole to your clipboard
Whisper files now open faster, especially the first one you open
Redesigned the history screen to look nicer. More improvements to this are coming soon
Fixed an issue on HTML export for batch transcriptions where the title was not correct
Filter FaceTime (because its audio is not available for privacy reasons) and MacWhisper from the available apps to record on the System Audio screen.
Fixed a small gap between segments when timestamps were enabled

6.1

Fixed an issue where performance on Intel Macs was slow
Fixed a design issue on Upgrade to Pro view

6.0.1

New:
- Metal support! The transcription process now runs using your GPU with the Metal framework. Especially on Apple Silicon Macs this leads to 2 to 3x speed improvements! Let us know if you run into anything related to this.
- You can now play and pause audio playback by pressing the spacebar
- You can use your media control buttons to control the audio
- Added a fast way to export from the menu bar. File > Export...
- MacWhisper will appear in Control Center when playing back audio
- Added support for Cantonese
Improved:
- Fixed the Hebrew language setting not working correctly in some cases
- m4v files can be opened again
- Audio files with audio panned to either the left or the right will work properly now

5.7

New:
- Added support for notifications to remind you when a transcription has been finished.
- You can now drag in (multiple) folders to perform batch transcriptions.
Improved:
- The quality of transcriptions should be better for certain files. If you were seeing repeated sentences, please let me know if this update fixes it.
- Made it clearer that the language you select in the bottom left is the input language of the audio that you want to transcribe.
- You should now be able to cancel an ongoing transcription without having to wait a long time.
- Made the app 1MB smaller by removing some very large wallpapers that were only used in small sizes.
- Show a "save confirmation" alert in more situations to prevent data loss.
- Fixed an issue where the speed toggle didn't work for a very small number of people, let me know if it still happens for you.

5.4

New:
You can now add more files to the batch transcription window after it's been opened.
Improved:
You can now translate from the Segments view as well. Use this if you want to export your translated transcript.
You can now split a segment by just pressing return. Hit shift-return to commit or click outside of the segment
If you remove all text from a segment it will be deleted automatically
While splitting segments the cursor will automatically move to the correct segment, making it easier to control with just your keyboard
The "transcribe podcast" and "transcribe recording" buttons can no longer be tapped multiple times leading to strange behaviour in the app
If you have enabled "play sound when finished" it will now also work on the menubar app
The search/filter no longer uses the full url of the file but just the last part
Increased history size from last 50 to 200

5.3.1

Fixed an issue where transcripts would not complete if the Remove Duplicates features was turned off.

5.3

New:
Sometimes the transcription framework would return a lot of duplicate segments. This should no longer happen. You can disable this feature in Settings > Advanced.
You can now set a maximum character limit for segments (useful if you want to adhere to the BBC subtitles size for example)
You can now choose the location where recordings are saved
You can now translate segments as well with DeepL
Added support for all DeepL languages

Improved:
The app will now prompt you to make sure you really want to close a window or quit the app if it could lead to data loss
Fixed typing glitch associated with committing a change
Fixed an issue where colors were set incorrectly when switching between dark and light mode
When searching you can now automatically scroll to the selected rows
Fixed an issue where the segment highlighting would jitter while transcription was still being finished
Improvements to splitting segments
Fixes to selecting text in segments view

5.2

Fixed an issue where sometimes the transcription would get stuck at 0%. Thanks for letting us know if you ran into this

5.1

New:
Split up segments! Press shift and return in the middle of a segment to split it up into a new line. More improvements coming to this area soon!
Sentence view is back! On the transcripts page you can now click the sentence button in the top left to display your transcripts in a more structured formatted
New save audio button to more easily export the audio file associated with a transcript. Click the waveform icon in the top left.
You can now delete all occurrences of a segment by right clicking. Useful if there's some repeated sentences. We'll add more options to auto remove these in the next two weeks.
Improved:
Fixed an issue where the File menu did not show save or open buttons
The pro upgrade screen can now be presented more consistently from the menubar app
Searching through transcripts is now a looooot faster
You can now open .whisper and audio/video files from each of the "open" flows in the app

5.0.1

Menu bar app! Quickly dictate recordings from the menubar app and copy them into any textfield you need.
View how long it took to transcribe the file after the transcription is finished
Quickly open the original audio file for a (microphone) transcription by clicking the waveform icon in the bar at the top
Added advanced settings to disable the confirmation alert when closing a transcript that has not been saved yet
Loading .whisper files is faster now
The progress bar is no longer shown when loading a .whisper file
More consistent design for buttons in the header bar
Files that are no longer available are automatically removed from your history
The settings button now also works on newer operating systems
Fixed an error when pasting a non-url into the url download bar
You can now right click on the text as well in the segments view if you want to perform an action on that segment
Fixed an issue where in system audio recording multiple recordings would stack
Fixed an issue where you could not save a system audio recording (right now only your microphone audio is saved in the .whisper file)

4.6

Added a button to unlock all features on the home screen if you're not using MacWhisper Pro

4.5

If you click the back button from the transcription page without saving, an alert will pop up to ask if you want to save your transcript.
The DeepL translation now takes into account punctuation (pretty silly we overlooked that) and works with more languages now.

4.4

Fixed an issue where the microphone recording was not transcribed during System Audio Recordings.
When "Play sound when finished" is enabled, the app will now only play the sound at the end of batch, podcast and system audio recordings and no longer for each file.

4.3

Fixed an issue where new users couldn't see the language and model quality selectors. Sorry about that!

4.2

New Home Screen design
History now shows up to 50 of your last used files
You can now search through your history
The speed selector can now be long pressed to select a specific speed
The used language is now presented in the top bar after a transcription is finished
Added support buttons to each section which give you more info on how to get the most out of the features
The system audio page now looks more in line with the rest of the app
Find and replace does not show “0 matches” when you have not entered any text yet
The translate button now shows even if you have not added your DeepL api key

4.1

Fixed a crash when searching for words in a transcript
You can now switch display modes while a file is being transcribed
Speakers are now available on the top level menu when right clicking a segment instead of in a sub menu?
You can toggle the auto scroll feature in the segments mode from the menu bar Transcripts > Toggle Auto Scroll
The record feature will now remember the last input device you used and will default to that if it’s available
Added a speaker limit of two for non Pro users

4.0.1

The focus for version 4.0 was performance and speed! We spent weeks rewriting the segments view in the app so that it scrolls fast even for super long transcripts.
You can now edit all text directly in the segments mode, without having to first click on a text label
Right click on the background of a segment to favorite, add speakers or delete the segment
You can now fill in any video or audio url to transcribe them directly, not just YouTube urls
Double click a cell to start playback from it
Improved scroll to segment performance while playing a track
You can now save your audio recordings from the File menu
Toggle timestamps on or off for the Segments view from within settings
YouTube made some changes under the hood so some videos might not be able to be transcribed
Fixed an issue where the textfield for adding names for podcast hosts would lose focus while you were typing

3.5.1

You can now export transcript to a .whisper format from the export / batch screens as well. Useful for if you want to transcribe files and save them as a .whisper file which contains the audio AND the transcripts and edits you've made. You can also use File > Save as... to achieve the same thing
YouTube downloads are significantly faster and now shows a progress indicator so you know how long a download will take.
Added a view to compare the quality of the different transcription quality models. This will help you decide which model you need for your purposes.
Added the option to save batch transcription exports to the same folder as the files you selected.
A bunch of quality of life improvements and design tweaks
The translation feature now supports more input languages such as Japanese and Russian
The models should now be able to be downloaded more reliably if your network has restrictions (such as a VPN or work network)
Fixed an issue where exports in paragraph view did not match the preview
Fixed a lot of small stuff that were noticed by Harrison!

3.3.1

This update fixes a bug where the app could crash when selecting a microphone in System Audio recordings. Sorry about that and thanks for letting me know so quickly so I could put out a fix!

3.3

Improved performance when viewing large transcripts. Scrolling should be snappier now.
Advanced Translation! You can now translate entire transcripts by using your own (free) DeepL API key. You will need a free (or Pro) DeepL API key, it's very easy to set up and you'll get 500.000 free translation characters every month. Be aware that when you translate your transcript, the content is sent to the DeepL servers for translating. This feature requires Whisper Pro. Right now you can translate into six different languages, but more are coming soon. Please send me any feedback on how you want to use it.
You can now select which input device to use for your Microphone Recordings or System App Audio Recordings
Made Find and Replace clearer

Improved: - Improved performance when viewing large transcripts. Scrolling should be snappier now.

3.2

Improved the recording screen experience.
Your recording audio volume is now displayed to make it clearer that your microphone is picking up what you're saying
Fixed an issue where audio you recorded could not be played back on the transcription screen
If you have denied permission for the System App Audio Recording feature, the app will now redirect you to the settings when you click on the menu option
WebVTT exports now display the speaker names in front of the transcript if you've added a speaker

3.0

Starting with MacWhisper 3.0, new updates will only support macOS 13.0 (Ventura) and up. I had to update my test device to the new Sonoma beta, and it's very hard to support older versions of macOS while keeping my sanity. You can continue using up to 2.21 on Monterey for as long as you need. You can download old versions of the app from the Gumroad page.
New display mode which shows the full transcript as one long piece of text. Thanks for all the requests!
Made it easier to toggle between display modes
You can now choose where to export batch transcription files to
Batch export to DOTE file format
Export to full transcript text file

2.20

Podcast Transcriptions! Easily transcribe your podcast by providing audio files for each host and MacWhisper will automatically transcribe them, separating each speaker's dialogue. Please keep in mind that this feature is still in beta testing, so you may encounter some issues. This feature will be Pro only starting in a later release, and is only available on macOS Ventura and up.
Save and load your transcriptions in a .whisper file format! You can now save transcribed files as .whisper files which you can easily share with others. They will include the audio file as well, so they can open them as if they made them themselves! Let me know if you run into anything!
You can now play a sound to be notified when a transcription is finished
Added a settings screen which you can access from the toolbar, menubar or by pressing "Cmd + ,". You'll find some common settings there that used to be in the toolbar.
Improved the design of the batch settings screen.
You can now access your recently used apps more easily for system app recording
Your recently used languages are now shown at the top of the language picker list
Greatly reduced memory usage when using different models in the same app run
Fixed a bunch of small bugs here and there
Editing segments performs a lot better now
Show icons in the history

2.17

Fixed an issue in System Audio Recording mode where the audio for the app recording would fail.

2.16

Fixes an issue where the microphone recording during System Audio recording could keep recording after you finished.
Microphone recordings are now saved with unique names to your Documents directory instead of as output.wav

2.15

Up to 40% speed boost! MacWhisper can now use all the CPU cores on your Mac. For M1/M2 Pro/Max computers this should result in around 40% faster transcription!
Added initial implementation for recording system audio from apps. This features is only available on Ventura because it uses APIs that are only available on Ventura. This feature will become for Pro users only in a later release. There will be bugs, so please report them to support@macwhisper.com.
Rewrote the foundations for the model downloader so they should fix issues with downloads
If a model download stops halfway through you can resume it later
Models are now grouped based on if they're English only or Multilingual
Added support to export to the DOTE transcription format
New app icon!

2.13

Manual speaker selection! You can now add speakers from the toolbar and then right click on single or multiple segments to add speakers. This is still very early and work in progress so please send me feedback :)
Batch export now works properly on Monterey
You can now export to multiple formats at once when performing batch transcription
Global Find & Replace can be accessed from everywhere
Hopefully fixed an issue where the cursor would jump to the end of the segment when editing the text
You can now adjust the text size

2.12

Batch Transcription! Drag and drop multiple files on MacWhisper to transcribe and export them one after another. Great if you need to transcribe a large number of audio files at the highest quality. You can just leave your Mac running overnight and wake up to fresh transcripts. This feature is available for MacWhisper Pro users only.
Added keyboard shortcuts to quickly open a file or start a recording (Cmd+O and Cmd+R) from the start screen
Some small design tweaks here and there

2.11

Transcription will now continue at full speed even if you run MacWhisper in the background!
Made it clearer that you can not close the model downloader screen until you've downloaded at least one model or if you're downloading a model.
Added buttons to open the Finder location for the downloaded models.
Global Find and Replace. Add words or phrases to be automatically replaced in new transcripts. This can be helpful for accents or names. Note: Right now the system replaces the text wherever it's found (even within other words) so, for example, replacing “you” will also replace the same letters in the word “your.” Access it from the settings icon in the toolbar.
You can now change the display mode for the transcript by clicking the display mode button in the toolbar
In Reader mode you can switch between showing the whole transcript as one chunk, or split up by sentences. The copy button will adjust based on your current mode.
The home screen now provides quick access to your three last used audio files (for now they will need to be re-transcribed each time, working on a way to save completed transcripts so you can continue working on them)
Clicking on a segment will now no longer play from that location but instead will let you edit. You can play a segment by clicking the play button on the right side
While editing a segment the text will no longer under/overlap with the buttons on the right side
You can now export to HTML, as in, the button works
You can now export to PDF as well

2.10

Fixed an issue in 2.9 since that update was accidentally sandboxed (normally meant for the Mac App Store). If you downloaded 2.9 you will probably have to manually update to 2.10 by downloading it directly from https://macwhisper-site.vercel.app/releases/MacWhisper.zip. Before you update you should delete all the models that were downloaded (again) in version 2.9 as they are saved in a different directory and would otherwise take up space on your Mac. Sorry about this!

2.9

You can now transcribe YouTube videos by pasting the url. This feature is only available for MacWhisper Pro users and only on Ventura, but you can test to see if it works if you're a free user as well :). Videos are downloaded to your documents folder for now, but please send me feedback on how well / not well this works for you!
HTML Export! Export your transcript into an HTML page. This version is very early and needs some design love, but I'm not great with HTML and CSS so bear with me here :)
If you copy the transcript from the toolbar or reader view it will now be exported as individual sentences instead of one big chunk
Fixed an issue where the reader view could not be opened on Monterey
Fixed an issue where export was not working on Monterey

2.8

Fixed an issue where the app would crash on Monterey

2.7

Added a new export preview screen where you can see what the output file will look like

2.6

Favourite segments are now highlighted on the slider bar at the bottom
Find and replace. You can now find and replace words across your transcript. Note that currently it will replace all occurrences of the string you're replacing, also if it's part of a larger word. Please send me feedback if you run into anything (Ventura only for now)
The reader mode now splits up the transcript in sentences for easier, well, reading!

2.5

Added a button in the toolbar that notifies when a new version is available

2.4

Fixed an issue where the app would sometimes randomly crash when transcribing files while it would work on the next try with no issues. Thanks a lot for sending the crash reports!

2.3

When you record audio with your microphone, the app will now show "New Recording" as the title of the file instead of an empty space
You can now change the playback speed of the audio recording. Play your audio at 0.5x up to 3x speeds by toggling with the button in the bottom right of the playback bar.
After you finish a microphone recording the app won't go back to the home screen until transcriptions are displayed.
The file formats that are presented on the landing screen are no longer spoken through Voice Over.

2.2

Fixed an issue where the microphone could not be used.

2.1

Downloaded Whisper Models are now saved in the Application Support/MacWhisper/models directory and they're excluded from backups.

2.0

The app is now very small! You will have to download the different quality levels manually, but they will persists across updates. This will make it a lot easier to handle updates in the future :)
The app can now automatically update itself without you having to download it again from the website
You can now favorite individual segments. This will be useful in a later version where you can save and load .whisper files
The scrub bar now shows the segment text while scrubbing so you can more easily find specific parts of a transcript
Click on a segment to play
Drag and Drop Voice Memos directly from the Voice Memos app into MacWhisper
Edit a segment by clicking on the edit button
Design tweaks to make the app nicer to look at
Fixed a crash where 8 bit mp3 files weren't able to be transcribed
Added a warning for users with 8GB of RAM to inform them that the higher quality transcription levels might not work on their device.
Improved support for dropping .m4a files
You can show your transcript in Compact Mode which hides the timestamps