Skip to content
View davisrbr's full-sized avatar

Highlights

  • Pro

Block or report davisrbr

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

NanoGPT (124M) in 3 minutes

Python 2,081 208 Updated Jan 18, 2025

[NeurIPS'24 Spotlight] Observational Scaling Laws

Jupyter Notebook 49 3 Updated Oct 2, 2024
Jupyter Notebook 2 2 Updated Jan 3, 2025

A Lossless Compression Library for AI pipelines

Python 216 26 Updated Jan 15, 2025

Inspect: A framework for large language model evaluations

Python 723 160 Updated Jan 17, 2025

An implementation of the Llama architecture, to instruct and delight

Python 21 Updated Jan 16, 2025
Python 99 10 Updated Dec 28, 2024

A fast + lightweight implementation of the GCG algorithm in PyTorch

Python 157 37 Updated Jan 7, 2025

AI Logging for Interpretability and Explainability🔬

Python 98 7 Updated Jun 7, 2024

Decomposing and Editing Predictions by Modeling Model Computation

Jupyter Notebook 131 8 Updated Jun 12, 2024

The nnsight package enables interpreting and manipulating the internals of deep learned models.

Jupyter Notebook 459 41 Updated Jan 15, 2025

Influence Functions with (Eigenvalue-corrected) Kronecker-Factored Approximate Curvature

Python 120 12 Updated Jul 31, 2024

ML Benchmarks in Algebraic Combinatorics

Jupyter Notebook 2 1 Updated Jan 17, 2025

Universal Neurons in GPT2 Language Models

Jupyter Notebook 27 6 Updated May 28, 2024

A benchmark to evaluate language models on questions I've previously asked them to solve.

Python 950 70 Updated Nov 4, 2024

[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning

Jupyter Notebook 403 40 Updated Oct 20, 2024

Training Sparse Autoencoders on Language Models

Jupyter Notebook 576 136 Updated Jan 17, 2025

Stanford NLP Python Library for Understanding and Improving PyTorch Models via Interventions

Python 679 71 Updated Jan 3, 2025

Public reposetory for code and results of parts of "Steering Llama 2 via Contrastive Activation Addition" by Rimsky, Gabrieli, Schulz et al.

Python 9 2 Updated Dec 22, 2023
Jupyter Notebook 82 10 Updated Jan 30, 2024

llmstep: [L]LM proofstep suggestions in Lean 4.

Python 120 15 Updated Nov 11, 2023
Python 28 3 Updated Sep 17, 2024

NeuroSurgeon is a package that enables researchers to uncover and manipulate subnetworks within models in Huggingface Transformers

Python 39 1 Updated Aug 3, 2024

[NeurIPS 2023] Learning Transformer Programs

Python 157 23 Updated May 21, 2024

Sparse probing paper full code.

Jupyter Notebook 53 10 Updated Dec 17, 2023

A library for bridging Python and HTML/Javascript (via Svelte) for creating interactive visualizations

Jupyter Notebook 14 11 Updated Apr 15, 2024
Python 513 44 Updated Feb 5, 2024

A concise but complete full-attention transformer with a set of promising experimental features from various papers

Python 4,987 432 Updated Jan 5, 2025
Next
Showing results