mirror of
https://github.com/albertan017/LLM4Decompile.git
synced 2026-06-17 01:55:50 +00:00
Integrate BringUpBench evaluation into sk2decompile/evaluation/bringupbench/, corresponding to Section A.6 of the paper (arXiv:2509.22114). BringUpBench is a benchmark suite of 90 self-contained C programs (505 functions, O0-O3). SK2Decompile achieves 42.3% compilation rate and 27.0% re-executability rate, compared to IDA Pro's 23.6% / 21.7%. Contents: - scripts/: 5-step reproduction pipeline (compile, decompile, map, infer, eval) - data/func_maps/: pre-built function-level mappings (source <-> pseudo <-> asm) - data/infer_results/: SK2Decompile inference outputs for all opt levels - reports/: per-opt-level evaluation result summaries (Markdown) - config.env: template environment configuration - README.md: comprehensive documentation with reproduction guide Also updated sk2decompile/README.md to reference BringUpBench evaluation. |
||
|---|---|---|
| .. | ||
| bringupbench | ||
| data | ||
| metrics | ||
| evaluate_exe.py | ||
| evaluate_r2i.py | ||
| gpt_judge.py | ||
| inf_type.py | ||
| llm_server.py | ||
| normalize_pseudo.py | ||
| sk2decompile_inf.py | ||
| stack.yaml | ||
| template.txt | ||