Skip to content

Latest commit

 

History

History
14 lines (10 loc) · 994 Bytes

README.md

File metadata and controls

14 lines (10 loc) · 994 Bytes

About

Extracts vendor name and vendor type from bank transaction memos. Focused on vendors from Bay Area, New York, and Boston. Made in collaboration with www.pilot.com.

F1-score of 89% | Trusted by product specialists | Unidirectional workflow

Final Design

Our Named Entity Recognition (NER) program is made with both rule-based and machine learning models. Using the Gestalt pattern matching algorithm, we compare the similarity between the outputs of both approaches to provide a confidence score. If the similarity is larger than 60% , we provide the user with the output from the Machine Learning algorithm. Else, we will provide the user both outputs and flag it to be double checked manually. Thus, we increase the reliability of the automated categorization and simplify the manual categorization process with tentative recommendations.

All training files, testing files, and any files that contain vendor information have been removed for privacy purposes.