ALDB – A Python Tool for Multilingual Anki Decks with Neural Voices

Hi everyone :saluting_face: ,

I’d like to introduce Anki Lang Deck Builder (ALDB), a project I developed entirely in Python. The program integrates edge-tts (for very high-quality neural voices that stay true to native accents) and genanki (for automatic deck generation) to create ready-to-use Anki decks for language learning.

It’s a 100% free and open-source tool, designed to save time for learners by automating the creation of .apkg decks with authentic audio. The project also provides a complete list of supported languages and voices, so you can easily explore the available options.

I’d like to share my GitHub repository and the link to my project presentation page, which includes a video and a screenshot. However, since I’m a new member I can’t post links yet. Could an admin let me know the proper way to share my project here?

Thank you very much! :grin:

2 Likes

Share the links in a preformatted/code block for now, and you can edit them later.

I moved your post to “Add-ons” because that seems the closest fit, but feel free to move it to “Learning Effectively” if you prefer.

1 Like

Hello,

Great idea, here’s the link :

You can find the demo video, screenshot, downloads, and the GitHub repository on my page.

Feedback and suggestions are very welcome. :slight_smile:

1 Like

The voices are the major feature of your app, yet your video has background music and showcases only pronunciations of individual words.

(I personally dislike AI pronunciations of any kind, but wanted to give feedback)

1 Like

Those AI voices sound very good!

How is it possible to generate them in bulk? I have around 30,000 words. Is it possible to upload a spreadsheet or something similar?

Do you have plans to make an add-on for Anki? Would that even be possible? I think it could become the most popular add-on and surpass other TTS add-ons. Currently, those have the advantage of being easier to use since they’re integrated into Anki itself. Personally, I think their main convenience is generating audio directly inside a user’s existing decks.

Your idea of a directly integrated Anki add-on is excellent! I’m going to look into adding a system to automatically scan spreadsheets so that voices can be generated automatically from your files. :grinning_face:

1 Like

Unless you know a person who can generate these voices in the language you’re learning, or you prefer to use Google’s basic voice, I think my alternative is an excellent solution, as the voices are very close to real human voices.
That said, you’re absolutely right about the presentation video — I should have shown more demonstrations with long sentences, female/male voices, multiple languages, etc.
If you want, you can still give it a try and provide feedback; you’ll see for yourself that even with long (or very long) sentences, the realism is impressive. The edge-tts module is really powerful.

UPDATE:

I added to my GitHub presentation page, for each language, an audio clip (male/female) of the sentence “Hello everyone, I am an artificial voice but I try to be as realistic as possible!” translated and pronounced in all languages available in ALDB.

You can directly preview the pronunciations from the website, without even downloading the program—it will already give you an idea! :grin:

1 Like

Since you insist…

  • The Arabic sample is laughably wrong. It couldn’t get the word for “hello” right!
  • The Japanese sample feels very “choppy”, at least at the greeting.
1 Like

Bro, this is a free project I created for you to enjoy. The voices come from the Microsoft Edge TTS service. :grin:
I don’t provide these neural voices myself. I just did my best to build a lightweight tool that helps language learners create quality audio cards.

1 Like

To be fair, you did ask for feedback. And you are touting how “close to real human voices” they are. If those aren’t languages your ear is accustomed to, you might value the knowledge that the service you’ve chosen isn’t universally great (so your tool might not be of the same quality for every language …).

1 Like

You are absolutely right on this point.
However, my comparison is made exclusively with the commonly used service, namely Google’s TTS, which is particularly flat and sounds very “synthetic” in most languages.

1 Like

I suspect that many folks who are experienced with trying to fit TTS into their language learning have found there are more fish in the sea than just those two!

1 Like

UPDATE :

  • Fixed the language loading bug when loading in CSV format.
  • Added 2 checkboxes to define whether to generate the audio file on the front or back side of the card.
1 Like

Sounds good to me actually. It has a weird vocal fry, but probably slightly better than Google TTS. It also does sound a bit more “human” probably because of the intonation.

(on a different note, I wonder how come we have awesome LLM coders nowadays and yet TTS voices suck so much…)

1 Like

I meant that the “minasan” and “konnichiwa” had too long pauses after them. I have done very little immersion so it might be just me.

1 Like

Here, all the Edge TTS voices are listed. Can you check if there are any better Japanese or Arabic voices on this list? You can listen to these voices on their website. If you found better ones, I’ll update my software to replace them. :slight_smile:

1 Like

The first time I tried that link, it had ONLY Spanish and English. The second time, it added several other languages, but did not include Japanese or Arabic. Nor any languages (other than Spanish/English) that I have enough experience with to judge. I didn’t think the Chinese sounded good, but I haven’t done enough with Chinese to trust my judgment. I did think the “Hong Kong English” sounded more like an American!

1 Like

Aha—there’s a trick to it. If you don’t see what you want on the “language” menu, go to the “Country” menu and click on whatever country is checked (ticked/selected) to unselect it. The Italian sounded pretty good. But the single example sentence is not adequate to really evaluate the quality.

1 Like

Apple has had some pretty good voices for years, but even those are noticeably fake even today. And some of their voices are terrible, but haven’t changed (as far as I can tell) in all those years.

1 Like

I’m actually thinking about making a similar tool who’ll be available directly from a website with a backend to generate audio files.

Also, with the possibility to have multiple choices of voices for a given language (I’m talking about all voices availables here : https://tts.travisvn.com/)

What do you think about this idea ?

1 Like