Duolingo Vocab Lists

This repository is a collection of Duolingo courses' vocabs.

Currently, the only course is Spanish (learned from English) in the "english-spanish" folder.

The list of words for a course can be downloaded in JSON and CSV format.

The main purpose of this project is to provide a means of getting the vocab in a format that is easy to import into a memorization tool. Tinycards does not seem to be maintained that well anymore (sadface).

Parsing other courses

Originially, I wrote this program to parse the words in the Spanish course from an awesome post in a Duolingo discussion. Thank you so much FieryCat!

However, now the program parses words from the website duome which has a very comprehensive list of the words for each course. Switched to this approach based on the detailed blog post by Melle Dijkstra.

To use the provided code:

Step 1: Clone it.

Run git clone [email protected]:jmbeach/duolingo-vocab-lists.git

Step 2: Install Dependencies / Build

Run yarn install.

Run yarn build.

Step 3: Get Skill Tree

Login to Duolingo.com. Scroll to the very bottom of the home page to make it load the entire course skill tree. Save the page as an HTML file. NOTE: you may have to clean the html file to ensure there is only one root note. For example: only Body as root.

Run node lib/index.js skilltree -f <path-to-html-file> to generate the skill tree JSON file. This is used to figure out what section each skill belongs to.

Step 4: Get Vocab List HTML

Go to https://duome.eu/<your-user-name>/progress. The skills tab contains an in-order list of all of the skills in your language. In the chrome developer console, run document.querySelectorAll('.click.skill') to expand every item on the page.

Once you've ran the querySelectorAll command, save the page to an HTML file. NOTE: you may have to clean the html file to ensure there is only one root note. For example: only Body as root.

NOTE: There might not be precise mappings between the skills found on the duome page and the duolingo page, so you might have to do some cleaning up (e.g. Adj 1 -> Adjective 1)

Step 5: Download Translations

Run node lib/index.js download -f <path-to-vocab-html-file> -s <path-to-skill-tree-json> [-a <google-api-key>] to download the translations to a JSON file.

The translator defaults to finding transaltions of words on Duolingo.com. However, if it can't find one, it uses Google Translate. To use google translate you'll have to get an API key and then put your API key into a .env file like this:

GOOGLE_TRANSLATE_API_KEY=<my-api-key>

NOTE: Make sure to change your desired language pair inside TranslationDownloader (it's es, en by default).

Step 6: Generate CSV Files

Finally, run node lib/index.js create -f <path-to-json-file> to turn the translations into CSV's.

If the new CSV's aren't in this repository yet, please feel free to create a pull request to add them. Currently, I've only processed Spanish (for English speakers), but would love to get other languages in here.

Step 7 (Optional): Create Combined CSV Files

It might be preferable for some people to have all of the CSV files for each section combined into one file. To generate these, run node lib/index.js combine -p <path to language directory>.

Name		Name	Last commit message	Last commit date
Latest commit History 110 Commits
.github/workflows		.github/workflows
.vscode		.vscode
english-spanish		english-spanish
greek-english		greek-english
src		src
.babelrc		.babelrc
.gitignore		.gitignore
README.md		README.md
duolingo-vocab-ex.gif		duolingo-vocab-ex.gif
package.json		package.json
tsconfig.json		tsconfig.json
yarn.lock		yarn.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Duolingo Vocab Lists

Parsing other courses

Step 1: Clone it.

Step 2: Install Dependencies / Build

Step 3: Get Skill Tree

Step 4: Get Vocab List HTML

Step 5: Download Translations

Step 6: Generate CSV Files

Step 7 (Optional): Create Combined CSV Files

About

Releases

Packages

Languages

hueldoeu/duolingo-vocab-lists

Folders and files

Latest commit

History

Repository files navigation

Duolingo Vocab Lists

Parsing other courses

Step 1: Clone it.

Step 2: Install Dependencies / Build

Step 3: Get Skill Tree

Step 4: Get Vocab List HTML

Step 5: Download Translations

Step 6: Generate CSV Files

Step 7 (Optional): Create Combined CSV Files

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages