- Fix a hang due to incorrect jump-table boundaries inferred from irrelevant register correlations to the index register
- Requires gtirb >=2.2.0
- Improved code inference:
- Do not miss code after literal pools.
- Switch decode mode if invalid instruction found in ARM.
- Fixed bug in pointers to string data blocks.
- Restrict padding blocks so they do not share instructions with code blocks.
- Start a new block if we transition from padding to not padding or from not padding to padding.
- Change the type of several heuristics from "simple" to "proportional"
- Additional heuristic: Simple string literals in literal pools
- Additional heuristic: Function beginning pattern with push/adjust-sp as plausible instruction sequence
- Fix bug that led to string data blocks potentially overlapping code blocks.
- Fix bug that resulted in integral symbols on ISAs other than x64 (ARM and x86).
- Fix symbolization bug of ADR instructions in ARM32 that refer to code.
- Fix bug in PE code inference that could lead to the whole .text section being declared invalid if a data directory was attached to the end of the section.
- Add alignments to data blocks that require alignment even within data sections
- Fix 16-Thumb STM instructions considered to be invalid if the same register is used in reglist and register operands with writeback enabled.
- Fixed bug that could result in missed symbolic expressions with TLS variables for
local-executable
TLS model - Fix bug that caused assembling error due to wrong
symbol_minus_symbol
for lsda entries with references to the end of.gcc_except_table
- Stop generating debian metapackages and packages with the version attached
to the package name. Updates in the apt-repository now support multiple
package versions and upgrading
ddisasm
withapt-get upgrade
. - Improve def-use and value-reg stack analysis to consider push and pop instructions. These changes also fix a couple of bugs in the stack variable propagation.
- Update LIEF to 0.13.2
- No longer consider
_x86.get_pc_thunk*
functions as ABI-intrinsic; this means_copy
is not appended to the original symbol, and a symbol forwarding entry is not created. - Fix handling of BLR instruction in ARM64.
- Fix size access of LDR instruction in ARM64.
- Extend value_reg analysis to support memory loads using a register with constant address.
- Refactor the code inference point system. Decouple heuristics from their weights. Heuristic weights can now be modified by providing user hints.
- Generate GOT, PAGE and GOT, OFST symbolic expression attributes for split .got loads on MIPS.
- Correct symbol_minus_symbol in lsda entries with a reference to the end of
.gcc_except_table
: addboundary_sym_expr
for such reference - Add
ElfSoname
aux-data forSONAME
dynamic-section entry - Requires gtirb >=2.1.0
- Track values of registers R8B - R15B on x86-64, which are in some cases needed for inferring jump table boundaries.
- Infer jump table boundaries from comparisons of registers correlated to the index register.
- Relax constraints for inferring jump table boundaries from comparisons of indirect operands
- Fix bug where a relative jump table starting with consecutive zero offsets was truncated at the first non-zero value.
- Add alignment for x86-64 instructions that require explicitly aligned memory (e.g., some SIMD instructions)
- Update capstone version from 4.0.1 to 5.0.1
- Avoid generating
_start
symbol when the entry-point address is not a code block.
- Prefer LOCAL symbols over GLOBAL ones when selecting symbols for symbolic expressions for ISAs other than MIPS.
- Support GTIRB sections with holes (byte intervals only covering part of the section).
- Use pre-existing code blocks as hints when disassembling a RAW binary.
- Better data access computation for MIPS binaries.
- Detect incremental linking regions in PE binaries.
- Create elfStackSize and elfStackExec auxdata from ELF PT_GNU_STACK segments.
- In PE binaries, every exported code symbol is considered a function entry.
- Fixed bug where
elfSymbolTabIdxInfo
aux data could refer to non-existent UUIDs. - Fixed unrecognized
tls_get_addr
pattern that could result in missed symbolic expressions. - Binaries with zero-sized
OBJECT
symbols no longer produce missing code blocks. $t
symbols in ARM binaries now force creation of Thumb-mode code blocks.- In PE binaries, duplicate imports no longer create duplicate symbols.
- Added pattern to match missed symbolic data in pointer arrays.
- Fix symbols associated to functions (Auxdata functionNames) for PE binaries
when Ddisasm is run with option
-F
. - Requires gtirb >=2.0.0, gtirb-pprinter >=2.0.0
- Update code inference to use weighted interval scheduling to resolve blocks; this improves code inference results, especially on ARM.
- ARM: Discover unreferenced code blocks occurring after literal pools.
- Refactored CFG inference. It now infers more kinds of indirect calls and and branches using value analysis, data accesses, and relocations.
- ELF: Infer
SHARED
orPIE
forDYN
binary type - ELF: Generate
elfDynamicInit
andelfDynamicFini
auxdata
- ARM: Improve code inference using unwind information from .ARM.exidx section
- Replace symbolic expression attributes with composable labels.
- ddisasm output now displays runtimes of "load", "compute", and "transform" phases of each analysis pass
- Add known_block and impossible_block passes to code inference.
- Various ARM32 code inference improvements.
- Various command-line options now apply to all datalog passes:
--debug-dir
creates subdirectories for each datalog pass--interpreter
argument now specifies the source code repository's root directory, defaults to the current working directory, and enables the interpreter for all datalog passes--profile
specifies a directory name where profiles for each datalog pass is written- Entries in files provided to
--hints
should namespace relation names using the name of the analysis pass, e.g.,disassembly.invalid
. --with-souffle-relations
keeps relations from all passes; entries in thesouffleFacts
andsouffleOutputs
auxdata are now namespaced with the name of the analysis pass, e.g.,disassembly.block_points
.
- Add support for x86-32 dynamic TLS.
- Improve IFUNC symbolic expression symbol selection.
- Several refactorings towards spliting code inference and symbolization.
- Refactor ELF symbol reading.
- Add "overlay" AuxData table.
- Update Souffle to version 2.4.
- Add cmake option
DDISASM_GENERATE_MANY
to use Souffle's--generate-many
code generation option; this should yield much faster incremental build times for ddisasm. - Utilize Souffle's feature to prune intermediate relations to reduce ddisasm's peak memory usage; processing large binaries can use up to ~20% less memory.
- Update LIEF to 0.13.0.
- Add Linux Python package for ddisasm.
- Discover ARM Thumb blocks at section start in stripped binaries.
- Bugfix for undefined TLS symbol relocations.
- Add ARM
HLT
andTRAP
instruction support. - Bugfix for aliased copy-relocations.
- Bugfix to avoid propagating synchronous accesses across symbols.
- Add build option for compiled Souffle profile generation.
- Improved x86-64 dynamic TLS support.
- Various logic improvements for GLIBC rewriting support.
- Update generated
elfSymbolVersions
auxdata. - Add "--ignore-errors" argument.
- Update LIEF to version 0.12.3.
- Add PE32 Safe Structured Exception Handling (SAFESEH) support.
- Update LIEF to version 0.12.1.
- Update Souffle to version 2.3.
- Ubuntu 18 and gcc7 are no longer supported
- ARM64 support improvements:
- Handle single-byte jump tables where the value should be interpreted as a signed difference.
- Prevent generation of incorrect :lo12: attributes in post-index operands.
- Add "--hints" argument.
- Rename
elfSectionIndex
tosectionIndex
, andelfSectionProperties
tosectionProperties
, and removepeSectionProperties
. - Refactor
section_complete
: addsection_property
, and renamesection_complete
tosection
, andsection
toloaded_section
. - Ensure ddisasm --version always reports a commit hash.
- Improve handling of unnamed relocations for libraries and object files.
- Improvements to jump table detection.
- ARM64 support improvements:
- Support symbolization of split loads separated by jump tables or temporary stack storage.
- Fix disassembling tbz and tbnz branch instructions.
- Fix value tracking of special registers (FP, LR, SP).
- Eliminate warnings generated by SYS instructions.
- Fix disassembling stripped ARM64 binaries.
- Release support for ARM32 and MIPS32 binaries.
- Optimizations to improve performance and reduce memory usage.
- Require GrammaTech's Capstone fork, >5.0.0.
- Add "--profile" argument.
- Add ELF x86_64 .o support.
- Add static archive support.
- Fix a bug with ELF file parsing that could generate failed "unknown binding in elfSymbolInfo" assertions.
- Infer "main" function in ARM64 binaries.
- Use dedicated symbolic expression attributes.
- Support Souffle interpreter for development.
- Add support for loading existing GTIRB files.
- Add PE support.
- Support ELF x64 static binaries
- Add preliminary x86-32 support.
- Populate GTIRB symbolic expression attributes.
- Update to souffle 2.0.2.
- Populate all allocated ELF sections.
- Register value analysis can track values through the stack.
- Refactor
gtirb-decoder
for more versatile loading of GTIRB.
- Restructure gtirb-to-datalog into
gtirb-decoder
component. - Move
ddisasm_main.cpp
toMain.cpp
. - Move
GtirbModuleDisassembler.cpp
toDisassembler.cpp
.
- Added preliminary ARM64 support. Contributed by the Programming Language Group, University of Sydney.
- Populate libraries and libraryPaths aux data tables.
- Allow using '-' as an output alias for stdout.
- Refactor binary object parsing into gtirb-builder.
- Fix performance bug for large data sections.
- Aggressive propagation for relative jump tables.
- Use defs through indirect jumps.
- Create data blocks spanning multiple bytes.
- Added --version flag, initial version is 1.0.0