Skip to content

Commit

Permalink
Adding new dict
Browse files Browse the repository at this point in the history
  • Loading branch information
AG committed Mar 5, 2024
1 parent 57308d5 commit 0988855
Show file tree
Hide file tree
Showing 2 changed files with 4 additions and 3 deletions.
1 change: 1 addition & 0 deletions dictionary-33.7MB.js

Large diffs are not rendered by default.

6 changes: 3 additions & 3 deletions train.py
Original file line number Diff line number Diff line change
Expand Up @@ -13,9 +13,9 @@
# RECIPES #
###########
# All with chunk size of 1024
# ?.?MB: Target dictionary count 100,000, Prune 1,000,000
# 8.5MB: Target dictionary count 25,000, Prune 10,000,000
# 5.4MB: Target dictionary count 10,000, Prune 10,000,000
# 33.7MB: Target dictionary count 100,000, Prune 1,000,000
# 8.5MB: Target dictionary count 25,000, Prune 10,000,000
# 5.4MB: Target dictionary count 10,000, Prune 10,000,000

PRUNE_FREQUENCY = 1000 * 1000 # Every this many document positions
CHUNK_SIZE = 1024 # 1KB per chunk
Expand Down

0 comments on commit 0988855

Please sign in to comment.