Skip to content

Commit

Permalink
πŸ“ Added data lakes stats
Browse files Browse the repository at this point in the history
  • Loading branch information
rcap107 committed Apr 22, 2024
1 parent eedebe8 commit 78fbfdc
Showing 1 changed file with 14 additions and 0 deletions.
14 changes: 14 additions & 0 deletions stats_data_lakes.csv
Original file line number Diff line number Diff line change
@@ -0,0 +1,14 @@
column,binary_update,wordnet_vldb,wordnet_full,wordnet_vldb_3,wordnet_vldb_10,wordnet_vldb_50,open_data_us
n_tables,70.0,869.0,30072.0,3162.0,10059.0,47223.0,5591.0
tot_rows,20099403.0,8012927.0,671926357.0,73449573.0,241564466.0,1200891362.0,95743105.0
tot_cols,140.0,7122.0,95193.0,38553.0,126999.0,623685.0,133385.0
mean_n_cols,2.0,8.195627157652474,3.1655027932960893,12.19259962049336,12.625410080524903,13.207229527984245,23.857091754605616
median_n_cols,2.0,6.0,3.0,10.0,10.0,11.0,14.0
mean_n_rows,287134.32857142854,9220.859608745684,22343.91982575153,23228.833965844402,24014.75951883885,25430.221756347542,17124.50456090145
median_n_rows,40407.5,74.0,927.0,1602.0,1698.0,1767.0,1000.0
mean_num_attr,0.3,1.761795166858458,0.38969805799414736,3.5,3.5436922159260362,3.5942443300933866,11.097835807547845
median_num_attr,0.0,2.0,0.0,3.0,3.0,3.0,3.0
mean_cat_attr,1.7,6.433831990794016,2.775804735301942,8.69259962049336,9.081717864598867,9.612985197890858,12.759255947057772
median_cat_attr,2.0,4.0,3.0,6.0,6.0,7.0,7.0
mean_avg_null,3.6331067085512065e-6,0.44415456947931703,0.309895468216302,0.6035821354579941,0.620554933070128,0.643162857541306,0.09416792546407236
median_avg_null,0.0,0.49772727272727274,0.33162393162393167,0.6635476878856157,0.6652240681278794,0.6795489102889002,0.01098901098901099

0 comments on commit 78fbfdc

Please sign in to comment.