heretic

mirror of https://github.com/p-e-w/heretic.git synced 2026-06-02 05:03:33 +02:00

Author	SHA1	Message	Date
Philipp Emanuel Weidmann	9b7624ddfa	build: bump version to 1.3.0 v1.3.0	2026-05-05 18:22:02 +05:30
Philipp Emanuel Weidmann	0e7c14d94a	fix: minor cleanups and improvements	2026-05-04 22:11:14 +05:30
Philipp Emanuel Weidmann	02ce8ad079	chore: update dependencies	2026-05-03 19:25:36 +05:30
Philipp Emanuel Weidmann	79ea9ce905	docs: update README	2026-05-03 09:08:57 +05:30
Philipp Emanuel Weidmann	216c089974	Revert "Revert "fix: disable LoRA export for now" (#308 )" (#319 ) This reverts commit `da92f745de`.	2026-05-03 07:25:00 +05:30
Philipp Emanuel Weidmann	43f8e86a84	fix: minor cleanups and improvements	2026-05-02 06:35:31 +05:30
anrp	da92f745de	Revert "fix: disable LoRA export for now" (#308 ) This reverts commit `025ab3a881`. Co-authored-by: Andrew Patrikalakis <anrp@tri.global>	2026-05-02 06:07:47 +05:30
dependabot[bot]	ebb5e651df	build(deps): bump mako from 1.3.10 to 1.3.11 (#309 ) Bumps [mako](https://github.com/sqlalchemy/mako) from 1.3.10 to 1.3.11. - [Release notes](https://github.com/sqlalchemy/mako/releases) - [Changelog](https://github.com/sqlalchemy/mako/blob/main/CHANGES) - [Commits](https://github.com/sqlalchemy/mako/commits) --- updated-dependencies: - dependency-name: mako dependency-version: 1.3.11 dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-04-25 08:14:38 +05:30
Philipp Emanuel Weidmann	513e3acc72	fix: improve the reproducibility system (#303 ) * fix: various cleanups and improvements for the reproducibility system * fix: save only essential settings * fix: improve model commit handling * feat: make including system information optional * fix: improve formatting of reproducibility README * fix: fix remaining issues	2026-04-23 19:08:18 +05:30
dependabot[bot]	c4d6a62aad	build(deps): bump python-dotenv from 1.2.1 to 1.2.2 (#305 ) Bumps [python-dotenv](https://github.com/theskumar/python-dotenv) from 1.2.1 to 1.2.2. - [Release notes](https://github.com/theskumar/python-dotenv/releases) - [Changelog](https://github.com/theskumar/python-dotenv/blob/main/CHANGELOG.md) - [Commits](https://github.com/theskumar/python-dotenv/compare/v1.2.1...v1.2.2) --- updated-dependencies: - dependency-name: python-dotenv dependency-version: 1.2.2 dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-04-23 12:40:50 +05:30
Olekssy	f654a43ac3	fix: prevent UnboundLocalError when analyzer is not initialized (#301 ) * fix: prevent UnboundLocalError when analyzer is not initialized Move cleanup of analyzer and residuals inside the conditional block where they are actually defined to avoid crashing when --print-residual-geometry or --plot-residuals are not used. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> * fix: address AI review feedback on residual cleanup --------- Co-authored-by: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-21 08:40:29 +05:30
Magic	ed5d8b9104	feat: add configurable residual processing to reduce peak VRAM usage (#239 ) * refactor residual memory optimizations * formatting * Fixed config.py positioning and default * fixed analyzier declaration in main.py * removing del statements * ruff * small updates * ty moveback ish	2026-04-18 16:46:22 +05:30
dependabot[bot]	5083fc0dd7	build(deps): bump pillow from 12.1.1 to 12.2.0 (#296 ) Bumps [pillow](https://github.com/python-pillow/Pillow) from 12.1.1 to 12.2.0. - [Release notes](https://github.com/python-pillow/Pillow/releases) - [Changelog](https://github.com/python-pillow/Pillow/blob/main/CHANGES.rst) - [Commits](https://github.com/python-pillow/Pillow/compare/12.1.1...12.2.0) --- updated-dependencies: - dependency-name: pillow dependency-version: 12.2.0 dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-04-14 19:07:47 +05:30
Darshan	cd422bbb99	fix: make --help return before heavy runtime imports (#293 )	2026-04-12 16:33:30 +05:30
MoonRide303	e2c74bfb3c	fix: support for gemma 4 (#287 )	2026-04-12 12:47:32 +05:30
Vinayyyy7	077e31f663	feat: reproducibility when saving & uploading a heretic model (#191 ) * feat: implement reproducibility features with safetensors * feat: prompt user before creating reproducibility folder * fix: use prompt_confirm wrapper * style comment * style comment * fix: ignore None values in Settings dump for TOML compatibility * fix: imports * feat: auto-generate seed if none provided for full reproducibility * style: fix ruff formatting issues * style: ruff * style: fix ty check errors with ty:ignore * Update src/heretic/main.py Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> * Update src/heretic/utils.py Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> * add period at end. Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> * Improve: Add README, checkpoint.jsonl, to Reproduce * fix: use centralize device info, remove random states file * feat: Add CUDA driver version * ruff * ruff... * ty fix * LGTM: Rich native strip, use nvidia-smi * ruff fix * ruff * revert kaggle hack) * normalize names for deduplication of packages/versions * docstring * rufff * cleanup, add suffix for torch CUDA version, distinguish ROCm * add PyTorch index URL detection * revert index URL to be simple * flip priority of index.. * add Important note * add exact suffix for WHL in instruction * add warning for heterogeneous GPU env * extend driver version info (more accelerators) * fix: style * sync * no abbreviation * use multi-line string * fix: prompt_confirm * feat: CPU info * strip 'slow' warning from environment.txt * feat: Add virtual env info to environment.txt * ruffff * feat: AMD (Radeon) GPU driver version * Refactor: system.py * feat: LGTM capturing specifc installation origin of heretic * feat: Include chosen trial into reproduce/README * style: run ruff format on utils.py * feat: reproduce.json * fix: seperate values in different keys * restore comment * style, clean, seperate commit key * no abbreviation, cleanup * remove labels, store only dependencies * missed import, ruff * sort import * feat: More CPU Info * only store direct dependencies of heretic * complete comment * refactor: use cpuinfo package instead * ruff import sort * distinguish cores & threads * move function amd-driver * rename * moving heretic package info, * rufff * Move: cleanup memory cache * fix: model.py import * no unknowns * generalize all accelerator info stuff * ruff f * move package info * type change * feat: no reproducibility suite for local saving/model used * import fix * fix: type check * style change * style ruff * feat: no env.txt, SHA256SUMS file, cleanup * feat: ADD tip to readme * remove trial index, two-keys only * fix: No time-zone * feat: No suite for local datasets allowed * simplify * featt: capture both direct and transitive dependencies * style: sort readme of reproducibility suite * feat: Store commit hash for datasets too * add total refusal prompts for evaluation display * remove try/except from cpu * extend SHA256 support * remove .txt * only have safetensors for SHA256 * style comment * use HF api to get commit hash * fix: requirements containing irrelevant dependencies * only store heretic-llm if from PyPI.. * add SELECTED tag to the trial that was pushed * AttributeError fix * simplify trial preservation * add direction_index in trial info * remove unwanted CPU info * style: rename --------- Co-authored-by: Vinayyyy7 <vinayumrethe99@gmail.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2026-04-11 19:15:19 +05:30
Arthur Wuhrmann	a1a1c30c58	fix: correct default value for max_memory. (#284 ) * fix: correct default value for max_memory. The other does not compile. * fix: update syntax for default value of max_memory	2026-04-08 18:47:41 +05:30
Philipp Emanuel Weidmann	b08a0925c1	feat: make response prefix logic configurable	2026-04-07 13:24:48 +05:30
Philipp Emanuel Weidmann	f612a48b9f	build: prevent installing dependency packages published in the past 7 days	2026-04-04 08:54:37 +05:30
dependabot[bot]	117e3b73ac	build(deps): bump urllib3 from 2.6.1 to 2.6.3 (#273 ) Bumps [urllib3](https://github.com/urllib3/urllib3) from 2.6.1 to 2.6.3. - [Release notes](https://github.com/urllib3/urllib3/releases) - [Changelog](https://github.com/urllib3/urllib3/blob/main/CHANGES.rst) - [Commits](https://github.com/urllib3/urllib3/compare/2.6.1...2.6.3) --- updated-dependencies: - dependency-name: urllib3 dependency-version: 2.6.3 dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-04-04 08:25:54 +05:30
dependabot[bot]	5f6e1e4d52	build(deps): bump requests from 2.32.5 to 2.33.0 (#272 ) Bumps [requests](https://github.com/psf/requests) from 2.32.5 to 2.33.0. - [Release notes](https://github.com/psf/requests/releases) - [Changelog](https://github.com/psf/requests/blob/main/HISTORY.md) - [Commits](https://github.com/psf/requests/compare/v2.32.5...v2.33.0) --- updated-dependencies: - dependency-name: requests dependency-version: 2.33.0 dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-04-04 08:25:25 +05:30
dependabot[bot]	7ebd92dfa7	build(deps): bump pygments from 2.19.2 to 2.20.0 (#271 ) Bumps [pygments](https://github.com/pygments/pygments) from 2.19.2 to 2.20.0. - [Release notes](https://github.com/pygments/pygments/releases) - [Changelog](https://github.com/pygments/pygments/blob/master/CHANGES) - [Commits](https://github.com/pygments/pygments/compare/2.19.2...2.20.0) --- updated-dependencies: - dependency-name: pygments dependency-version: 2.20.0 dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-04-04 08:24:56 +05:30
dependabot[bot]	655d66ef24	build(deps): bump nltk from 3.9.3 to 3.9.4 (#270 ) Bumps [nltk](https://github.com/nltk/nltk) from 3.9.3 to 3.9.4. - [Changelog](https://github.com/nltk/nltk/blob/develop/ChangeLog) - [Commits](https://github.com/nltk/nltk/compare/3.9.3...3.9.4) --- updated-dependencies: - dependency-name: nltk dependency-version: 3.9.4 dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-04-04 08:24:29 +05:30
dependabot[bot]	0f99c882ec	build(deps): bump filelock from 3.20.0 to 3.20.3 (#269 ) Bumps [filelock](https://github.com/tox-dev/py-filelock) from 3.20.0 to 3.20.3. - [Release notes](https://github.com/tox-dev/py-filelock/releases) - [Changelog](https://github.com/tox-dev/filelock/blob/main/docs/changelog.rst) - [Commits](https://github.com/tox-dev/py-filelock/compare/3.20.0...3.20.3) --- updated-dependencies: - dependency-name: filelock dependency-version: 3.20.3 dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-04-04 08:23:59 +05:30
dependabot[bot]	92f851b693	build(deps): bump pillow from 12.0.0 to 12.1.1 (#268 ) Bumps [pillow](https://github.com/python-pillow/Pillow) from 12.0.0 to 12.1.1. - [Release notes](https://github.com/python-pillow/Pillow/releases) - [Changelog](https://github.com/python-pillow/Pillow/blob/main/CHANGES.rst) - [Commits](https://github.com/python-pillow/Pillow/compare/12.0.0...12.1.1) --- updated-dependencies: - dependency-name: pillow dependency-version: 12.1.1 dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-04-04 08:23:32 +05:30
dependabot[bot]	81e0c84ec6	build(deps): bump aiohttp from 3.13.2 to 3.13.4 (#267 ) --- updated-dependencies: - dependency-name: aiohttp dependency-version: 3.13.4 dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-04-04 08:10:51 +05:30
Philipp Emanuel Weidmann	887d43a8d9	fix: set batch size on HFLM object	2026-04-01 14:27:43 +05:30
Philipp Emanuel Weidmann	96c7a7d98a	fix: replace tqdm progress bars with Rich progress bars	2026-03-28 18:30:15 +05:30
Philipp Emanuel Weidmann	1126332281	feat: add integrated benchmarking system	2026-03-24 18:25:12 +05:30
Philipp Emanuel Weidmann	19cdf7e244	fix: address ty complaint	2026-03-15 09:58:00 +05:30
Philipp Emanuel Weidmann	94775d4148	chore: update dependencies	2026-03-15 09:31:32 +05:30
cpagac	515a7b9eb5	fix: prevent div-by-zero in evaluator when base_refusals is 0 (#225 ) * fix: prevent div-by-zero in evaluator when base_refusals is 0 When a model refuses all prompts from the start, base_refusals is 0. Return refusals directly in that case so ablations that introduce new refusals are still penalized correctly. * fix: cast refusals to float for type consistency" before hitting commit changes Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> --------- Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2026-03-13 11:21:23 +05:30
erm14254	e26da5e0e6	fix: display all abliterable components across layers (#215 ) * fix: display all abliterable components across layers The current code only displays abliterable components from layer 0, which is misleading for hybrid architectures like Qwen3.5 that use different attention types across layers (e.g., `linear_attn.out_proj` in some layers, `self_attn.o_proj` in others). This fix iterates through all layers to collect and display the complete set of abliterable components with accurate module counts. Before (Qwen3.5-27B): * attn.out_proj: 1 modules per layer * mlp.down_proj: 1 modules per layer After (Qwen3.5-27B): * attn.out_proj: 48 modules total * attn.o_proj: 16 modules total * mlp.down_proj: 64 modules total * Fix formatting --------- Co-authored-by: Lawfer12 <ac728@ymail.com>	2026-03-11 14:10:37 +05:30
Philipp Emanuel Weidmann	ec0367226d	style: fix formatting and naming	2026-03-06 13:18:08 +05:30
Matthias Stegner	5e3c04c802	feat: add Qwen3.5 MoE hybrid layer support (#187 ) * feat: add Qwen3.5 MoE hybrid layer support Qwen3.5 MoE uses GatedDeltaNet (linear attention) on some layers instead of standard self-attention, causing abliteration to fail because self_attn.o_proj doesn't exist on those layers. Changes: - Wrap self_attn.o_proj in suppress(Exception) and add linear_attn.out_proj as alternative attention out-projection for GatedDeltaNet layers - Scan all layers in get_abliterable_components() instead of only layer 0, since hybrid models have different components on different layers - Derive LoRA target_modules from actual named_modules() instead of splitting component keys, which fails when module names differ across layers (e.g. "o_proj" vs "out_proj") Tested with Qwen3.5-397B-A17B (7/100 refusals, KL 0.2676). Relates to #43 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * Apply suggestion from @gemini-code-assist[bot] Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by: Philipp Emanuel Weidmann <pew@worldwidemann.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2026-03-06 13:03:57 +05:30
Spiky Moth	303ba9d978	fix: recheck prefix after inserting predefined (#194 )	2026-02-27 08:07:33 +05:30
Philipp Emanuel Weidmann	cb4ef3fdfc	docs: add Trendshift badge to README	2026-02-20 13:00:19 +05:30
cpagac	4c80c4beb9	fix: report VRAM usage across all GPUs instead of only the default device (#169 ) memory_allocated() and memory_reserved() without a device argument only report GPU 0. Sum across all devices for correct multi-GPU totals and add total VRAM reporting.	2026-02-17 12:53:41 +05:30
Spiky Moth	3a115e280c	fix: produce card for local models with existing readme (#157 )	2026-02-15 19:10:10 +05:30
Philipp Emanuel Weidmann	27097bfe8e	build: bump version to 1.2.0 v1.2.0	2026-02-14 18:11:42 +05:30
Philipp Emanuel Weidmann	025ab3a881	fix: disable LoRA export for now Workaround for #152	2026-02-14 16:56:12 +05:30
Philipp Emanuel Weidmann	1179013999	docs: update README	2026-02-14 16:32:08 +05:30
Philipp Emanuel Weidmann	fe7bc1bae3	docs: update README	2026-02-14 10:47:28 +05:30
Philipp Emanuel Weidmann	e70a1a85e8	fix: don't load checkpoint when evaluating a second model Fixes #144	2026-02-14 10:02:17 +05:30
Philipp Emanuel Weidmann	e7f8be98b7	fix: only export tokenizer when exporting full model Fixes #143	2026-02-14 09:18:22 +05:30
Philipp Emanuel Weidmann	6017bcd347	fix: use compatible release specifiers for non-dev dependencies Fixes #145 Credit to MuX on Discord for recognizing that this is an issue with Transformers 5	2026-02-13 12:27:57 +05:30
Philipp Emanuel Weidmann	dd0b3a2f69	docs: update README	2026-02-11 11:09:17 +05:30
Philipp Emanuel Weidmann	b873598b77	docs: improve settings documentation	2026-02-11 10:19:05 +05:30
Philipp Emanuel Weidmann	10ceb3098e	chore: update copyright notice	2026-02-11 09:46:36 +05:30
Salman Chishti	745b582414	ci: upgrade GitHub Actions to latest versions (#137 ) Signed-off-by: Salman Muin Kayser Chishti <13schishti@gmail.com>	2026-02-08 16:49:04 +05:30

1 2 3 4

151 Commits