Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Command to update case names #4619

Open
wants to merge 5 commits into
base: main
Choose a base branch
from

Conversation

quevon24
Copy link
Member

@quevon24 quevon24 commented Oct 25, 2024

This is the command to update the long resource case names

To run it:
manage.py update_casenames_csv --csv /opt/courtlistener/cl/assets/media/casenames_to_update.csv

This is the first round to correct the names. For this specific case I focused on processing the names where there are two matches, since this is the simplest, since we can identify them by splitting the name by "v." and it will only return two results.

The next step is to improve the names where we have more than two parties, this is the most complex since we do not have any label or similar that tells us where one ends and the next begins.

You can take a look at the file here:

https://drive.google.com/file/d/1V14_4zjx5zGjqAJepVT8kSxiKjYj3yYs/view?usp=sharing

The csv has this columns:

  • cluster_id: the cluster that we need to update
  • source: it could be only R or a merged source
  • case_name: empty because we don't have it in CL
  • case_name_full: the case name we have in CL
  • parties_text: the text extracted from the raw content
  • new_case_name: the suggested case name

@quevon24 quevon24 linked an issue Oct 25, 2024 that may be closed by this pull request
@quevon24 quevon24 requested a review from flooie October 25, 2024 14:43
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Super long case name's from resource.org need cleanup
1 participant