Skip to content

Latest commit

 

History

History
executable file
·
102 lines (85 loc) · 2.62 KB

HISTORY.md

File metadata and controls

executable file
·
102 lines (85 loc) · 2.62 KB

======== Releases

Version 0.0.9

Release date: Jul 1, 2024

  • Skip building language binaries from source

Version 0.0.8

Release date: Aug 17, 2023

  • Update format codetext_cli
  • Update PythonParser: Handle class definitions with empty argument list class ABC()
  • Add Javascript undeclared functions
  • Add PHP interface
  • Add Ruby actions with block parameters

Version 0.0.7

Release date: Jul 5, 2023

  • Update all class extractor format (using dict instead of list)
  • Fix missing identifier, parameter in C, C#, Java parser
  • Implement CLI

Version 0.0.6

Release date: Jan 9, 2023

  • Add tree sitter utils (in codetext.parser)
  • Replace all match_from_span to get_node_text
  • Replace all traverse_type to get_node_by_kind
  • Fix CppParser.get_function_metadata missing param_type and param_identifier
  • Update return metadata from all parser

Version 0.0.5

Release date: Dec 12, 2022

  • Fix package import path
  • Adding auto build workflow
  • Seperate codetext parser with processing source code
  • Fix remove_comment_delimiter remove leading whitespace
  • Update unittest for parser and utilites

Version 0.0.4

Release date: Dec 2, 2022

  • Fix main package root path
  • Loosen docstring_parser dependency

Version 0.0.3

Release date: Dec 2, 2022

  • New clean docstring function
    • check_docstring_contain_question
    • check_docstring_underdevelopment
    • check_docstring_autogenerated
    • check_contain_little_single_char
    • check_contain_many_special_char
    • check_contain_little_unique_chars
    • check_contain_little_unique_words
    • check_contain_many_special_case
    • check_contain_too_many_variables
    • check_contain_many_repeated_word
    • check_contain_many_uppercase_word
    • check_contain_many_long_word

Version 0.0.2

Release date: Nov 25, 2022

  • Language parser for Rust
    • get_docstring
    • get_class_list, get_function_list
    • get_class_metadata, get_function_metadata
  • Processing utils:
    • extract_docstring
    • extract_node
    • get_line_definitions
    • get_node_definitions
    • process_raw_node
  • Postprocessing:
    • Merge file (from batches)
    • Split into train/test/valid (by #sample category)
    • Deduplicate sample

Version 0.0.1

Release date: Nov 9, 2022

  • Language parser for Java, Python, JavaScript, PHP, Golang, Ruby, C++, C#, C
    • get_docstring
    • get_class_list, get_function_list
    • get_class_metadata, get_function_metadata
  • Clean docstring function
  • Data preprocessing source code
  • Tree-sitter utils: build_language, parse_code