Constitutional AI

This repo is an attempt to reproduce the results of Anthropic's paper on Constitutional AI. The paper can be found here. In particular, I am using the Hugging Face method described here.

In short I will attempt the following:

Create a dataset using Mistral-7B-Instruct-v0.1 from some of Anthropics Red teaming prompts
Fine-tune the model on this dataset
Evaluate the model on its ability to generate text that is aligned with the constitution

I'm going to attempt to do as much in possible in Typescript, as I think it is a wholly superior language to Python. 😜

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
data		data
public		public
src		src
ultrachat_baseline		ultrachat_baseline
ultrachat_cai		ultrachat_cai
ultrachat_sai		ultrachat_sai
.eslintignore		.eslintignore
.gitattributes		.gitattributes
.gitignore		.gitignore
.nvmrc		.nvmrc
.prettierignore		.prettierignore
.prettierrc.yml		.prettierrc.yml
.python-version		.python-version
README.md		README.md
eslint.config.js		eslint.config.js
index.html		index.html
input.css		input.css
output.css		output.css
package-lock.json		package-lock.json
package.json		package.json
post.md		post.md
requirements.txt		requirements.txt
tailwind.config.js		tailwind.config.js
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Constitutional AI

About

Releases

Packages

Languages

vanbujm/con-ai

Folders and files

Latest commit

History

Repository files navigation

Constitutional AI

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages