Skip to content

Utilized the Csvkit library to explore and clean CSV files from the command line

Notifications You must be signed in to change notification settings

syed0019/Data_Cleaning_and_Exploration_Using_Csvkit

Repository files navigation

Data_Cleaning_and_Exploration_Using_Csvkit

In this project, we utitlized the Csvkit library, which supercharges your workflow by adding 13 new command line tools specifically for working with CSV files. We focused on below 5 tools from Csvkit:

  • csvstack: for stacking rows from multiple CSV files.
  • csvlook: renders CSV in pretty table format.
  • csvcut: for selecting specific columns from a CSV file.
  • csvstat: for calculating descriptive statistics for some or all columns.
  • csvgrep: for filtering tabular data using specific criteria.

We used csvkit version 0.9.1 in this mission and you can read about the installation procedure in the documentation. Furthermore, we worked with the 3 datasets on housing affordability which may be downloaded from below:

  • Hud_2005.csv ===> here.
  • Hud_2007.csv ===> here.
  • Hud_2013.csv ===> here.

About

Utilized the Csvkit library to explore and clean CSV files from the command line

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages