Skip to content

Python project to process a weakly structured docx file and produce a structured, 508 compliant docx file, a txt version, and a machine readable version.

License

Notifications You must be signed in to change notification settings

measiala/PUMS_Data_Dictionary

Repository files navigation

PUMS_Data_Dictionary

This is an update to the previous project PUMS_Data_Dictionary. It greatly improves both the object-oriented approach and couples it with more robust testing using py.test. This will be the finished version to hand off to others.

To-Do List

[x] Complete the handling of record types and the machine readable file.

[x] Complete final write out of files including testing.

[x] Obtain final layout of input file to include variable type

[x] Add tailored log output

[ ] Obtain final layout of machine readable file (improved)

[x] Neaten up test cases

[ ] Remove unused code

Uses the following modules:

argparse os re logging textwrap docx* py.test*

*Not installed by default

About

Python project to process a weakly structured docx file and produce a structured, 508 compliant docx file, a txt version, and a machine readable version.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages