This is a python script for automatically generating an Academic Word list. Reads XML or text documents in any language. Preliminary tests on Norwegian.
The functions do not create stems or lemmas. This must be done additionally for each language considered.