Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

USLM to Plain Text #33

Open
iamwillbar opened this issue Jan 19, 2024 · 1 comment
Open

USLM to Plain Text #33

iamwillbar opened this issue Jan 19, 2024 · 1 comment
Assignees

Comments

@iamwillbar
Copy link

Is the source code that converts USLM into plain text, as shown on the TXT view of congress.gov, open source? I've been unsuccessful in finding it.

@llaplant
Copy link
Member

For enrolled bills, public laws, and the Statutes at Large on GovInfo, the USLM XML is currently produced during GovInfo processing as a result of a conversion from locator files into USLM XML. The locator files are an internal, legacy, format that is used by GPO as part of the print production process. The TXT files (that have hard returns at the end of each line) on GovInfo and Congress.gov are also a by-product of the print production process. GPO uses a legacy program to convert locator files into TXT files prior to ingest into GovInfo. Note, bills are available in TXT, PDF, and XML ("Bill-DTD XML") formats. Enrolled bills, public laws, and the Statutes at Large are also available in USLM XML. GPO is in the process of replacing its legacy print composition system with an XML-based system and workflows. More information is available in GPO's XPub GitHub repository along with sample responsive HTML files which will be made available on GovInfo when XPub is deployed for bills. As XPub is rolled-out, more bill versions and publications will be made available in responsive HTML and USLM XML formats.

https://github.com/usgpo/xpub

@llaplant llaplant self-assigned this Jan 22, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants