ALDB – A Python Tool for Multilingual Anki Decks with Neural Voices

Emmanuell · August 21, 2025, 9:12pm

Hi everyone ,

I’d like to introduce Anki Lang Deck Builder (ALDB), a project I developed entirely in Python. The program integrates edge-tts (for very high-quality neural voices that stay true to native accents) and genanki (for automatic deck generation) to create ready-to-use Anki decks for language learning.

It’s a 100% free and open-source tool, designed to save time for learners by automating the creation of .apkg decks with authentic audio. The project also provides a complete list of supported languages and voices, so you can easily explore the available options.

I’d like to share my GitHub repository and the link to my project presentation page, which includes a video and a screenshot. However, since I’m a new member I can’t post links yet. Could an admin let me know the proper way to share my project here?

Thank you very much!

Danika_Dakika · August 21, 2025, 11:54pm

Share the links in a preformatted/code block for now, and you can edit them later.

I moved your post to “Add-ons” because that seems the closest fit, but feel free to move it to “Learning Effectively” if you prefer.

Emmanuell · August 22, 2025, 8:00am

Hello,

Great idea, here’s the link :

You can find the demo video, screenshot, downloads, and the GitHub repository on my page.

Feedback and suggestions are very welcome.

ZornHadNoChoice · August 22, 2025, 11:25am

The voices are the major feature of your app, yet your video has background music and showcases only pronunciations of individual words.

(I personally dislike AI pronunciations of any kind, but wanted to give feedback)

Hey_Mo · August 22, 2025, 12:18pm

Those AI voices sound very good!

How is it possible to generate them in bulk? I have around 30,000 words. Is it possible to upload a spreadsheet or something similar?

Do you have plans to make an add-on for Anki? Would that even be possible? I think it could become the most popular add-on and surpass other TTS add-ons. Currently, those have the advantage of being easier to use since they’re integrated into Anki itself. Personally, I think their main convenience is generating audio directly inside a user’s existing decks.

Emmanuell · August 22, 2025, 12:52pm

Your idea of a directly integrated Anki add-on is excellent! I’m going to look into adding a system to automatically scan spreadsheets so that voices can be generated automatically from your files.

Emmanuell · August 22, 2025, 12:54pm

Unless you know a person who can generate these voices in the language you’re learning, or you prefer to use Google’s basic voice, I think my alternative is an excellent solution, as the voices are very close to real human voices.
That said, you’re absolutely right about the presentation video — I should have shown more demonstrations with long sentences, female/male voices, multiple languages, etc.
If you want, you can still give it a try and provide feedback; you’ll see for yourself that even with long (or very long) sentences, the realism is impressive. The edge-tts module is really powerful.

UPDATE:

I added to my GitHub presentation page, for each language, an audio clip (male/female) of the sentence “Hello everyone, I am an artificial voice but I try to be as realistic as possible!” translated and pronounced in all languages available in ALDB.

You can directly preview the pronunciations from the website, without even downloading the program—it will already give you an idea!

ZornHadNoChoice · August 22, 2025, 1:53pm

Since you insist…

The Arabic sample is laughably wrong. It couldn’t get the word for “hello” right!
The Japanese sample feels very “choppy”, at least at the greeting.

Emmanuell · August 22, 2025, 5:16pm

Bro, this is a free project I created for you to enjoy. The voices come from the Microsoft Edge TTS service.
I don’t provide these neural voices myself. I just did my best to build a lightweight tool that helps language learners create quality audio cards.

Danika_Dakika · August 22, 2025, 5:53pm

To be fair, you did ask for feedback. And you are touting how “close to real human voices” they are. If those aren’t languages your ear is accustomed to, you might value the knowledge that the service you’ve chosen isn’t universally great (so your tool might not be of the same quality for every language …).

Emmanuell · August 22, 2025, 5:55pm

You are absolutely right on this point.
However, my comparison is made exclusively with the commonly used service, namely Google’s TTS, which is particularly flat and sounds very “synthetic” in most languages.

Danika_Dakika · August 22, 2025, 7:12pm

I suspect that many folks who are experienced with trying to fit TTS into their language learning have found there are more fish in the sea than just those two!

Emmanuell · August 23, 2025, 6:30am

UPDATE :

Fixed the language loading bug when loading in CSV format.
Added 2 checkboxes to define whether to generate the audio file on the front or back side of the card.

sorata · August 23, 2025, 7:57am

Sounds good to me actually. It has a weird vocal fry, but probably slightly better than Google TTS. It also does sound a bit more “human” probably because of the intonation.

(on a different note, I wonder how come we have awesome LLM coders nowadays and yet TTS voices suck so much…)

ZornHadNoChoice · August 23, 2025, 8:29am

I meant that the “minasan” and “konnichiwa” had too long pauses after them. I have done very little immersion so it might be just me.

Emmanuell · August 23, 2025, 9:04am

Here, all the Edge TTS voices are listed. Can you check if there are any better Japanese or Arabic voices on this list? You can listen to these voices on their website. If you found better ones, I’ll update my software to replace them.

WGroleau · August 24, 2025, 3:42am

The first time I tried that link, it had ONLY Spanish and English. The second time, it added several other languages, but did not include Japanese or Arabic. Nor any languages (other than Spanish/English) that I have enough experience with to judge. I didn’t think the Chinese sounded good, but I haven’t done enough with Chinese to trust my judgment. I did think the “Hong Kong English” sounded more like an American!

WGroleau · August 24, 2025, 3:55am

Aha—there’s a trick to it. If you don’t see what you want on the “language” menu, go to the “Country” menu and click on whatever country is checked (ticked/selected) to unselect it. The Italian sounded pretty good. But the single example sentence is not adequate to really evaluate the quality.

WGroleau · August 24, 2025, 3:59am

Apple has had some pretty good voices for years, but even those are noticeably fake even today. And some of their voices are terrible, but haven’t changed (as far as I can tell) in all those years.

Emmanuell · August 25, 2025, 3:08pm

I’m actually thinking about making a similar tool who’ll be available directly from a website with a backend to generate audio files.

Also, with the possibility to have multiple choices of voices for a given language (I’m talking about all voices availables here : https://tts.travisvn.com/)

What do you think about this idea ?

Topic		Replies	Views
Anki Quick Text-to-Speech Insertion Software Free Premium Microsoft Natural Audio Unlimited Usage Quick and Convenient! Add-ons	0	34	December 14, 2025
Text 2 speech to learn 2 different languages Help	7	1316	May 1, 2023
Custom tts voice in windows 10 for anki? Help	13	1174	July 11, 2024
I Made a Tool That Turns Your Voice / lectures into High-Quality Anki Flashcards Learning Effectively	5	711	January 29, 2025
Batch creation of audio for a deck Help	6	3221	February 22, 2023

ALDB – A Python Tool for Multilingual Anki Decks with Neural Voices

Related topics