heretic

mirror of https://github.com/p-e-w/heretic.git synced 2026-06-01 20:58:47 +02:00

Author	SHA1	Message	Date
UnstableLlama	b79aa717c6	feat: add config.nohumor.toml (#340 ) * feat: add config.nohumor * Update config.nohumor.toml Following style guide Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> * Update config.nohumor.toml Reduced initial comments --------- Co-authored-by: UnstableLlama <randomnotrealemail@gmail.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2026-05-31 15:26:40 +05:30
Rocker Zhang	db07814a97	build(deps): remove unused hf-transfer dependency (#338 ) hf-transfer is declared in pyproject.toml but never activated: nothing in the codebase sets HF_HUB_ENABLE_HF_TRANSFER, and downloads go through from_pretrained / hf_hub_download with no transfer toggle. huggingface-hub is pinned ~=1.7, where Xet is the default transfer backend, so hf-transfer is dead weight and only surfaces a deprecation warning.	2026-05-31 15:16:31 +05:30
Rocker Zhang	b790094193	feat: support plain text files as prompt datasets (#337 ) A dataset path that points to a plain file is now read as one prompt per line, with empty lines ignored. For text files, "column" is ignored and "split" is optional; when given, it selects a subset of lines using slice notation (e.g. "[:400]"). Detection uses os.path.isfile so files without an extension also work. The split-parsing logic is factored into a shared get_split_slice helper, which derives the split name from the specification, and split/column are now optional in DatasetSpecification, with the dataset branches raising a clear error when either is missing. An invalid split raises instead of being silently ignored. A bare slice does not parse with the pinned datasets version, since ReadInstruction.from_spec expects a named split, so the text branch prepends a synthetic split name. Revives the approach from #103. Closes #98. Co-authored-by: Ric <ricyoung@gmail.com>	2026-05-31 15:06:47 +05:30
kabachuha	6338e2c99b	feat: add "disclaimer" to the prohibited strings list (#334 ) * add "disclaimer" to the prohibited strings list The favorite Gemma's word. * add "disclaimer" to config.py refusal markers	2026-05-28 17:36:30 +05:30
dependabot[bot]	4dcacb5eba	build(deps): bump urllib3 from 2.6.3 to 2.7.0 (#328 ) Bumps [urllib3](https://github.com/urllib3/urllib3) from 2.6.3 to 2.7.0. - [Release notes](https://github.com/urllib3/urllib3/releases) - [Changelog](https://github.com/urllib3/urllib3/blob/main/CHANGES.rst) - [Commits](https://github.com/urllib3/urllib3/compare/2.6.3...2.7.0) --- updated-dependencies: - dependency-name: urllib3 dependency-version: 2.7.0 dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-05-22 15:00:08 +05:30
dependabot[bot]	b8d2c5a7e9	build(deps): bump idna from 3.11 to 3.15 (#327 ) Bumps [idna](https://github.com/kjd/idna) from 3.11 to 3.15. - [Release notes](https://github.com/kjd/idna/releases) - [Changelog](https://github.com/kjd/idna/blob/master/HISTORY.md) - [Commits](https://github.com/kjd/idna/compare/v3.11...v3.15) --- updated-dependencies: - dependency-name: idna dependency-version: '3.15' dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-05-22 14:56:21 +05:30
Philipp Emanuel Weidmann	4e3a3a78a3	docs: update README	2026-05-22 14:51:24 +05:30
iuyua9	551db26bb7	fix: recognize root Hugging Face repo IDs (#325 ) * fix: recognize root Hugging Face repo IDs * fix: propagate invalid HF repo ids * fix: match transformers local path precedence	2026-05-16 09:19:15 +05:30
dependabot[bot]	8b5b85bec9	build(deps): bump mako from 1.3.11 to 1.3.12 (#323 ) Bumps [mako](https://github.com/sqlalchemy/mako) from 1.3.11 to 1.3.12. - [Release notes](https://github.com/sqlalchemy/mako/releases) - [Changelog](https://github.com/sqlalchemy/mako/blob/main/CHANGES) - [Commits](https://github.com/sqlalchemy/mako/commits) --- updated-dependencies: - dependency-name: mako dependency-version: 1.3.12 dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-05-09 15:19:28 +05:30
anrp	1b4851536d	fix: Reset model after saving merged model (#321 ) * fix: Reset model after saving merged model The adapter is lost and writes 0-byte adapters if you save an adapter after saving the merged model. * Revert "Revert "Revert "fix: disable LoRA export for now" (#308)" (#319)" This reverts commit `216c089974`. * Add comment as to why resetting model is needed	2026-05-09 15:16:26 +05:30
Philipp Emanuel Weidmann	b2bdc1f9d6	feat: add functionality for collecting reproduce.json files from Hugging Face	2026-05-07 18:33:50 +05:30
Philipp Emanuel Weidmann	9b7624ddfa	build: bump version to 1.3.0 v1.3.0	2026-05-05 18:22:02 +05:30
Philipp Emanuel Weidmann	0e7c14d94a	fix: minor cleanups and improvements	2026-05-04 22:11:14 +05:30
Philipp Emanuel Weidmann	02ce8ad079	chore: update dependencies	2026-05-03 19:25:36 +05:30
Philipp Emanuel Weidmann	79ea9ce905	docs: update README	2026-05-03 09:08:57 +05:30
Philipp Emanuel Weidmann	216c089974	Revert "Revert "fix: disable LoRA export for now" (#308 )" (#319 ) This reverts commit `da92f745de`.	2026-05-03 07:25:00 +05:30
Philipp Emanuel Weidmann	43f8e86a84	fix: minor cleanups and improvements	2026-05-02 06:35:31 +05:30
anrp	da92f745de	Revert "fix: disable LoRA export for now" (#308 ) This reverts commit `025ab3a881`. Co-authored-by: Andrew Patrikalakis <anrp@tri.global>	2026-05-02 06:07:47 +05:30
dependabot[bot]	ebb5e651df	build(deps): bump mako from 1.3.10 to 1.3.11 (#309 ) Bumps [mako](https://github.com/sqlalchemy/mako) from 1.3.10 to 1.3.11. - [Release notes](https://github.com/sqlalchemy/mako/releases) - [Changelog](https://github.com/sqlalchemy/mako/blob/main/CHANGES) - [Commits](https://github.com/sqlalchemy/mako/commits) --- updated-dependencies: - dependency-name: mako dependency-version: 1.3.11 dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-04-25 08:14:38 +05:30
Philipp Emanuel Weidmann	513e3acc72	fix: improve the reproducibility system (#303 ) * fix: various cleanups and improvements for the reproducibility system * fix: save only essential settings * fix: improve model commit handling * feat: make including system information optional * fix: improve formatting of reproducibility README * fix: fix remaining issues	2026-04-23 19:08:18 +05:30
dependabot[bot]	c4d6a62aad	build(deps): bump python-dotenv from 1.2.1 to 1.2.2 (#305 ) Bumps [python-dotenv](https://github.com/theskumar/python-dotenv) from 1.2.1 to 1.2.2. - [Release notes](https://github.com/theskumar/python-dotenv/releases) - [Changelog](https://github.com/theskumar/python-dotenv/blob/main/CHANGELOG.md) - [Commits](https://github.com/theskumar/python-dotenv/compare/v1.2.1...v1.2.2) --- updated-dependencies: - dependency-name: python-dotenv dependency-version: 1.2.2 dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-04-23 12:40:50 +05:30
Olekssy	f654a43ac3	fix: prevent UnboundLocalError when analyzer is not initialized (#301 ) * fix: prevent UnboundLocalError when analyzer is not initialized Move cleanup of analyzer and residuals inside the conditional block where they are actually defined to avoid crashing when --print-residual-geometry or --plot-residuals are not used. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> * fix: address AI review feedback on residual cleanup --------- Co-authored-by: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-21 08:40:29 +05:30
Magic	ed5d8b9104	feat: add configurable residual processing to reduce peak VRAM usage (#239 ) * refactor residual memory optimizations * formatting * Fixed config.py positioning and default * fixed analyzier declaration in main.py * removing del statements * ruff * small updates * ty moveback ish	2026-04-18 16:46:22 +05:30
dependabot[bot]	5083fc0dd7	build(deps): bump pillow from 12.1.1 to 12.2.0 (#296 ) Bumps [pillow](https://github.com/python-pillow/Pillow) from 12.1.1 to 12.2.0. - [Release notes](https://github.com/python-pillow/Pillow/releases) - [Changelog](https://github.com/python-pillow/Pillow/blob/main/CHANGES.rst) - [Commits](https://github.com/python-pillow/Pillow/compare/12.1.1...12.2.0) --- updated-dependencies: - dependency-name: pillow dependency-version: 12.2.0 dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-04-14 19:07:47 +05:30
Darshan	cd422bbb99	fix: make --help return before heavy runtime imports (#293 )	2026-04-12 16:33:30 +05:30
MoonRide303	e2c74bfb3c	fix: support for gemma 4 (#287 )	2026-04-12 12:47:32 +05:30
Vinayyyy7	077e31f663	feat: reproducibility when saving & uploading a heretic model (#191 ) * feat: implement reproducibility features with safetensors * feat: prompt user before creating reproducibility folder * fix: use prompt_confirm wrapper * style comment * style comment * fix: ignore None values in Settings dump for TOML compatibility * fix: imports * feat: auto-generate seed if none provided for full reproducibility * style: fix ruff formatting issues * style: ruff * style: fix ty check errors with ty:ignore * Update src/heretic/main.py Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> * Update src/heretic/utils.py Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> * add period at end. Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> * Improve: Add README, checkpoint.jsonl, to Reproduce * fix: use centralize device info, remove random states file * feat: Add CUDA driver version * ruff * ruff... * ty fix * LGTM: Rich native strip, use nvidia-smi * ruff fix * ruff * revert kaggle hack) * normalize names for deduplication of packages/versions * docstring * rufff * cleanup, add suffix for torch CUDA version, distinguish ROCm * add PyTorch index URL detection * revert index URL to be simple * flip priority of index.. * add Important note * add exact suffix for WHL in instruction * add warning for heterogeneous GPU env * extend driver version info (more accelerators) * fix: style * sync * no abbreviation * use multi-line string * fix: prompt_confirm * feat: CPU info * strip 'slow' warning from environment.txt * feat: Add virtual env info to environment.txt * ruffff * feat: AMD (Radeon) GPU driver version * Refactor: system.py * feat: LGTM capturing specifc installation origin of heretic * feat: Include chosen trial into reproduce/README * style: run ruff format on utils.py * feat: reproduce.json * fix: seperate values in different keys * restore comment * style, clean, seperate commit key * no abbreviation, cleanup * remove labels, store only dependencies * missed import, ruff * sort import * feat: More CPU Info * only store direct dependencies of heretic * complete comment * refactor: use cpuinfo package instead * ruff import sort * distinguish cores & threads * move function amd-driver * rename * moving heretic package info, * rufff * Move: cleanup memory cache * fix: model.py import * no unknowns * generalize all accelerator info stuff * ruff f * move package info * type change * feat: no reproducibility suite for local saving/model used * import fix * fix: type check * style change * style ruff * feat: no env.txt, SHA256SUMS file, cleanup * feat: ADD tip to readme * remove trial index, two-keys only * fix: No time-zone * feat: No suite for local datasets allowed * simplify * featt: capture both direct and transitive dependencies * style: sort readme of reproducibility suite * feat: Store commit hash for datasets too * add total refusal prompts for evaluation display * remove try/except from cpu * extend SHA256 support * remove .txt * only have safetensors for SHA256 * style comment * use HF api to get commit hash * fix: requirements containing irrelevant dependencies * only store heretic-llm if from PyPI.. * add SELECTED tag to the trial that was pushed * AttributeError fix * simplify trial preservation * add direction_index in trial info * remove unwanted CPU info * style: rename --------- Co-authored-by: Vinayyyy7 <vinayumrethe99@gmail.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2026-04-11 19:15:19 +05:30
Arthur Wuhrmann	a1a1c30c58	fix: correct default value for max_memory. (#284 ) * fix: correct default value for max_memory. The other does not compile. * fix: update syntax for default value of max_memory	2026-04-08 18:47:41 +05:30
Philipp Emanuel Weidmann	b08a0925c1	feat: make response prefix logic configurable	2026-04-07 13:24:48 +05:30
Philipp Emanuel Weidmann	f612a48b9f	build: prevent installing dependency packages published in the past 7 days	2026-04-04 08:54:37 +05:30
dependabot[bot]	117e3b73ac	build(deps): bump urllib3 from 2.6.1 to 2.6.3 (#273 ) Bumps [urllib3](https://github.com/urllib3/urllib3) from 2.6.1 to 2.6.3. - [Release notes](https://github.com/urllib3/urllib3/releases) - [Changelog](https://github.com/urllib3/urllib3/blob/main/CHANGES.rst) - [Commits](https://github.com/urllib3/urllib3/compare/2.6.1...2.6.3) --- updated-dependencies: - dependency-name: urllib3 dependency-version: 2.6.3 dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-04-04 08:25:54 +05:30
dependabot[bot]	5f6e1e4d52	build(deps): bump requests from 2.32.5 to 2.33.0 (#272 ) Bumps [requests](https://github.com/psf/requests) from 2.32.5 to 2.33.0. - [Release notes](https://github.com/psf/requests/releases) - [Changelog](https://github.com/psf/requests/blob/main/HISTORY.md) - [Commits](https://github.com/psf/requests/compare/v2.32.5...v2.33.0) --- updated-dependencies: - dependency-name: requests dependency-version: 2.33.0 dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-04-04 08:25:25 +05:30
dependabot[bot]	7ebd92dfa7	build(deps): bump pygments from 2.19.2 to 2.20.0 (#271 ) Bumps [pygments](https://github.com/pygments/pygments) from 2.19.2 to 2.20.0. - [Release notes](https://github.com/pygments/pygments/releases) - [Changelog](https://github.com/pygments/pygments/blob/master/CHANGES) - [Commits](https://github.com/pygments/pygments/compare/2.19.2...2.20.0) --- updated-dependencies: - dependency-name: pygments dependency-version: 2.20.0 dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-04-04 08:24:56 +05:30
dependabot[bot]	655d66ef24	build(deps): bump nltk from 3.9.3 to 3.9.4 (#270 ) Bumps [nltk](https://github.com/nltk/nltk) from 3.9.3 to 3.9.4. - [Changelog](https://github.com/nltk/nltk/blob/develop/ChangeLog) - [Commits](https://github.com/nltk/nltk/compare/3.9.3...3.9.4) --- updated-dependencies: - dependency-name: nltk dependency-version: 3.9.4 dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-04-04 08:24:29 +05:30
dependabot[bot]	0f99c882ec	build(deps): bump filelock from 3.20.0 to 3.20.3 (#269 ) Bumps [filelock](https://github.com/tox-dev/py-filelock) from 3.20.0 to 3.20.3. - [Release notes](https://github.com/tox-dev/py-filelock/releases) - [Changelog](https://github.com/tox-dev/filelock/blob/main/docs/changelog.rst) - [Commits](https://github.com/tox-dev/py-filelock/compare/3.20.0...3.20.3) --- updated-dependencies: - dependency-name: filelock dependency-version: 3.20.3 dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-04-04 08:23:59 +05:30
dependabot[bot]	92f851b693	build(deps): bump pillow from 12.0.0 to 12.1.1 (#268 ) Bumps [pillow](https://github.com/python-pillow/Pillow) from 12.0.0 to 12.1.1. - [Release notes](https://github.com/python-pillow/Pillow/releases) - [Changelog](https://github.com/python-pillow/Pillow/blob/main/CHANGES.rst) - [Commits](https://github.com/python-pillow/Pillow/compare/12.0.0...12.1.1) --- updated-dependencies: - dependency-name: pillow dependency-version: 12.1.1 dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-04-04 08:23:32 +05:30
dependabot[bot]	81e0c84ec6	build(deps): bump aiohttp from 3.13.2 to 3.13.4 (#267 ) --- updated-dependencies: - dependency-name: aiohttp dependency-version: 3.13.4 dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-04-04 08:10:51 +05:30
Philipp Emanuel Weidmann	887d43a8d9	fix: set batch size on HFLM object	2026-04-01 14:27:43 +05:30
Philipp Emanuel Weidmann	96c7a7d98a	fix: replace tqdm progress bars with Rich progress bars	2026-03-28 18:30:15 +05:30
Philipp Emanuel Weidmann	1126332281	feat: add integrated benchmarking system	2026-03-24 18:25:12 +05:30
Philipp Emanuel Weidmann	19cdf7e244	fix: address ty complaint	2026-03-15 09:58:00 +05:30
Philipp Emanuel Weidmann	94775d4148	chore: update dependencies	2026-03-15 09:31:32 +05:30
cpagac	515a7b9eb5	fix: prevent div-by-zero in evaluator when base_refusals is 0 (#225 ) * fix: prevent div-by-zero in evaluator when base_refusals is 0 When a model refuses all prompts from the start, base_refusals is 0. Return refusals directly in that case so ablations that introduce new refusals are still penalized correctly. * fix: cast refusals to float for type consistency" before hitting commit changes Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> --------- Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2026-03-13 11:21:23 +05:30
erm14254	e26da5e0e6	fix: display all abliterable components across layers (#215 ) * fix: display all abliterable components across layers The current code only displays abliterable components from layer 0, which is misleading for hybrid architectures like Qwen3.5 that use different attention types across layers (e.g., `linear_attn.out_proj` in some layers, `self_attn.o_proj` in others). This fix iterates through all layers to collect and display the complete set of abliterable components with accurate module counts. Before (Qwen3.5-27B): * attn.out_proj: 1 modules per layer * mlp.down_proj: 1 modules per layer After (Qwen3.5-27B): * attn.out_proj: 48 modules total * attn.o_proj: 16 modules total * mlp.down_proj: 64 modules total * Fix formatting --------- Co-authored-by: Lawfer12 <ac728@ymail.com>	2026-03-11 14:10:37 +05:30
Philipp Emanuel Weidmann	ec0367226d	style: fix formatting and naming	2026-03-06 13:18:08 +05:30
Matthias Stegner	5e3c04c802	feat: add Qwen3.5 MoE hybrid layer support (#187 ) * feat: add Qwen3.5 MoE hybrid layer support Qwen3.5 MoE uses GatedDeltaNet (linear attention) on some layers instead of standard self-attention, causing abliteration to fail because self_attn.o_proj doesn't exist on those layers. Changes: - Wrap self_attn.o_proj in suppress(Exception) and add linear_attn.out_proj as alternative attention out-projection for GatedDeltaNet layers - Scan all layers in get_abliterable_components() instead of only layer 0, since hybrid models have different components on different layers - Derive LoRA target_modules from actual named_modules() instead of splitting component keys, which fails when module names differ across layers (e.g. "o_proj" vs "out_proj") Tested with Qwen3.5-397B-A17B (7/100 refusals, KL 0.2676). Relates to #43 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * Apply suggestion from @gemini-code-assist[bot] Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by: Philipp Emanuel Weidmann <pew@worldwidemann.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2026-03-06 13:03:57 +05:30
Spiky Moth	303ba9d978	fix: recheck prefix after inserting predefined (#194 )	2026-02-27 08:07:33 +05:30
Philipp Emanuel Weidmann	cb4ef3fdfc	docs: add Trendshift badge to README	2026-02-20 13:00:19 +05:30
cpagac	4c80c4beb9	fix: report VRAM usage across all GPUs instead of only the default device (#169 ) memory_allocated() and memory_reserved() without a device argument only report GPU 0. Sum across all devices for correct multi-GPU totals and add total VRAM reporting.	2026-02-17 12:53:41 +05:30
Spiky Moth	3a115e280c	fix: produce card for local models with existing readme (#157 )	2026-02-15 19:10:10 +05:30

1 2 3 4

162 Commits