Release date: Jul 1, 2024
- Skip building language binaries from source
Release date: Aug 17, 2023
- Update format codetext_cli
- Update PythonParser: Handle class definitions with empty argument list class ABC()
- Add Javascript undeclared functions
- Add PHP interface
- Add Ruby actions with block parameters
Release date: Jul 5, 2023
- Update all class extractor format (using dict instead of list)
- Fix missing identifier, parameter in C, C#, Java parser
- Implement CLI
Release date: Jan 9, 2023
- Add tree sitter utils (in codetext.parser)
- Replace all
match_from_span
toget_node_text
- Replace all
traverse_type
toget_node_by_kind
- Fix
CppParser.get_function_metadata
missingparam_type
andparam_identifier
- Update return metadata from all parser
Release date: Dec 12, 2022
- Fix package import path
- Adding auto build workflow
- Seperate codetext parser with processing source code
- Fix
remove_comment_delimiter
remove leading whitespace - Update unittest for parser and utilites
Release date: Dec 2, 2022
- Fix main package root path
- Loosen
docstring_parser
dependency
Release date: Dec 2, 2022
- New clean docstring function
- check_docstring_contain_question
- check_docstring_underdevelopment
- check_docstring_autogenerated
- check_contain_little_single_char
- check_contain_many_special_char
- check_contain_little_unique_chars
- check_contain_little_unique_words
- check_contain_many_special_case
- check_contain_too_many_variables
- check_contain_many_repeated_word
- check_contain_many_uppercase_word
- check_contain_many_long_word
Release date: Nov 25, 2022
- Language parser for Rust
- get_docstring
- get_class_list, get_function_list
- get_class_metadata, get_function_metadata
- Processing utils:
- extract_docstring
- extract_node
- get_line_definitions
- get_node_definitions
- process_raw_node
- Postprocessing:
- Merge file (from batches)
- Split into train/test/valid (by #sample category)
- Deduplicate sample
Release date: Nov 9, 2022
- Language parser for Java, Python, JavaScript, PHP, Golang, Ruby, C++, C#, C
- get_docstring
- get_class_list, get_function_list
- get_class_metadata, get_function_metadata
- Clean docstring function
- Data preprocessing source code
- Tree-sitter utils: build_language, parse_code