davisrbr

Davis Brown davisrbr

10 followers · 6 following

Achievements

Highlights

Stars

KellerJordan / modded-nanogpt

NanoGPT (124M) in 3 minutes

Python 2,081 208 Updated Jan 18, 2025

ryoungj / ObsScaling

[NeurIPS'24 Spotlight] Observational Scaling Laws

Jupyter Notebook 49 3 Updated Oct 2, 2024

BrachioLab / exlib

Jupyter Notebook 2 2 Updated Jan 3, 2025

zipnn / zipnn

A Lossless Compression Library for AI pipelines

Python 216 26 Updated Jan 15, 2025

UKGovernmentBEIS / inspect_ai

Inspect: A framework for large language model evaluations

Python 723 160 Updated Jan 17, 2025

thecharlieblake / lovely-llama

An implementation of the Llama architecture, to instruct and delight

Python 21 Updated Jan 16, 2025

Cornell-RelaxML / qtip

Python 99 10 Updated Dec 28, 2024

GraySwanAI / nanoGCG

A fast + lightweight implementation of the GCG algorithm in PyTorch

Python 157 37 Updated Jan 7, 2025

logix-project / logix

AI Logging for Interpretability and Explainability🔬

Python 98 7 Updated Jun 7, 2024

MadryLab / modelcomponents

Decomposing and Editing Predictions by Modeling Model Computation

Jupyter Notebook 131 8 Updated Jun 12, 2024

ndif-team / nnsight

The nnsight package enables interpreting and manipulating the internals of deep learned models.

Jupyter Notebook 459 41 Updated Jan 15, 2025

pomonam / kronfluence

Influence Functions with (Eigenvalue-corrected) Kronecker-Factored Approximate Curvature

Python 120 12 Updated Jul 31, 2024

pnnl / ML4AlgComb

ML Benchmarks in Algebraic Combinatorics

Jupyter Notebook 2 1 Updated Jan 17, 2025

wesg52 / universal-neurons

Universal Neurons in GPT2 Language Models

Jupyter Notebook 27 6 Updated May 28, 2024

carlini / yet-another-applied-llm-benchmark

A benchmark to evaluate language models on questions I've previously asked them to solve.

Python 950 70 Updated Nov 4, 2024

princeton-nlp / LESS

[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning

Jupyter Notebook 403 40 Updated Oct 20, 2024

jbloomAus / SAELens

Training Sparse Autoencoders on Language Models

Jupyter Notebook 576 136 Updated Jan 17, 2025

stanfordnlp / pyvene

Stanford NLP Python Library for Understanding and Improving PyTorch Models via Interventions

Python 679 71 Updated Jan 3, 2025

wusche1 / CAA_hallucination

Public reposetory for code and results of parts of "Steering Llama 2 via Contrastive Activation Addition" by Rimsky, Gabrieli, Schulz et al.

Python 9 2 Updated Dec 22, 2023

saprmarks / dictionary_learning

Python 181 46 Updated Jan 16, 2025

KihoPark / linear_rep_geometry

Jupyter Notebook 82 10 Updated Jan 30, 2024

wellecks / llmstep

llmstep: [L]LM proofstep suggestions in Lean 4.

Python 120 15 Updated Nov 11, 2023

google-research / gpax

Python 28 3 Updated Sep 17, 2024

neelnanda-io / 1L-Sparse-Autoencoder

Python 114 12 Updated Oct 28, 2023

mlepori1 / NeuroSurgeon

NeuroSurgeon is a package that enables researchers to uncover and manipulate subnetworks within models in Huggingface Transformers

Python 39 1 Updated Aug 3, 2024

princeton-nlp / TransformerPrograms

[NeurIPS 2023] Learning Transformer Programs

Python 157 23 Updated May 21, 2024

wesg52 / sparse-probing-paper

Sparse probing paper full code.

Jupyter Notebook 53 10 Updated Dec 17, 2023

Mech-Interp / PySvelte

Forked from anthropics/PySvelte

A library for bridging Python and HTML/Javascript (via Svelte) for creating interactive visualizations

Jupyter Notebook 14 11 Updated Apr 15, 2024

google-deepmind / tracr

Python 513 44 Updated Feb 5, 2024

lucidrains / x-transformers

A concise but complete full-attention transformer with a set of promising experimental features from various papers

Python 4,987 432 Updated Jan 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Davis Brown davisrbr

Achievements

Achievements

Highlights

Block or report davisrbr

Stars

KellerJordan / modded-nanogpt

ryoungj / ObsScaling

BrachioLab / exlib

zipnn / zipnn

UKGovernmentBEIS / inspect_ai

thecharlieblake / lovely-llama

Cornell-RelaxML / qtip

GraySwanAI / nanoGCG

logix-project / logix

MadryLab / modelcomponents

ndif-team / nnsight

pomonam / kronfluence

pnnl / ML4AlgComb

wesg52 / universal-neurons

carlini / yet-another-applied-llm-benchmark

princeton-nlp / LESS

jbloomAus / SAELens

stanfordnlp / pyvene

wusche1 / CAA_hallucination

saprmarks / dictionary_learning

KihoPark / linear_rep_geometry

wellecks / llmstep

google-research / gpax

neelnanda-io / 1L-Sparse-Autoencoder

mlepori1 / NeuroSurgeon

princeton-nlp / TransformerPrograms

wesg52 / sparse-probing-paper

Mech-Interp / PySvelte

google-deepmind / tracr

lucidrains / x-transformers