Psych-201

Psych-201 is a collaborative data set currently under construction. It extends Psych-101, a large-scale data set containing natural language transcriptions of human psychology experiments. We aim to collect a data set 10x the size of Psych-101, meaning that Psych-201 would cover 100,000,000 human choices from nearly 1,000,000 participants.

Important: If you would like to contribute an experiment, please verify first here that it is not already work in progress! You can also find a list of experiments up for grabs on this link.

Current progress: 16,106,681/100,000,000 human choices

We will summarize this new data set in a paper for the NeurIPS 2025 Datasets and Benchmarks Track (or a similar venue). Every contributor will be eligible for co-authorship on this paper. If you want to contribute, please follow the instructions below. In case you are unsure whether the experiment you have in mind is suitable for Psych-201, please reach out first.

How to contribute

Repository structure

If you want to contribute an experiment, please make a pull request (more on this below). For this, first create a new folder with a meaningful name. This folder should contain:

a README.md file with a paper reference and a link to the original data (if no paper or preprint is available, please add an extended description of the experiment and a statement regarding IRB approval).
a zipped prompts.jsonl file where each line corresponds to one participant. Each line should have the following three fields:
- "text": Natural language transcription of the experiment.
- "experiment": Identifier for the experiment.
- "participant": Identifier for the participant.
The following fields are optional and may be used to include meta-information if it is available (an example of an experiment with such information can be found here):
- "RTs": list of reaction times after stimulus onset (in ms). The length of this list should match the number of choices made by the participant.
- "age": Age of the participant.
- "diagnosis": Clinical diagnosis of the participant.
- "nationality": Nationality of the participant.
- any statistics derived from a questionaire, e.g., "STICSA-T somatic", "STICSA-T cognitive", etc.
a generate_prompts.py file that reads in the original data and produces prompts.jsonl

Please do not upload the original data files to this repository.

This folder provides an example for structuring.

Setup

Fork the repository:

Clone the forked repository:

git clone https://github.com/YOUR-USERNAME/Psych-201

Add your experiments and push:

cd Psych-201/
(add your experiments)
git push

On https://github.com/YOUR-USERNAME/Psych-201 click on "Pull requests", then "New pull request", finally "Create pull request".

Review process

There will be a lightweight review process ensuring that requirements are fulfilled. The communication for this will be done via the corresponding pull request.

We accept experiments that fulfill the following requirements:

Data comes from an experiment with human subjects.
Data is given on a trial-by-trial level (i.e., no aggregated data).
Experiment can be translated into language without a major loss of information.
Temporal ordering should follow the original experiment.

Formatting

Each prompt corresponds to an entire experimental session from one participant.
It should contain data on a trial-by-trial level and start with the instructions.
Use the cover story and instructions from the original paper if possible.
32K tokens per participant is the length limit.
Mark the parts that should be finetuned (i.e., human responses) with “<<“ and “>>”. Do not use these symbols for other parts of the prompt.
For discrete choice options, randomize the names of choice options for each participant binz2022heuristics/generate_prompts.py.

Example prompt:

You will be visiting different planets on which you repeatedly have to predict a winner in an athletic competition between two aliens, labelled A and V.
In each round, you have to indicate the winner by pressing the corresponding key.
Your goal is to be as accurate as possible.
To aid your decision process, you are provided with two attributes of the two aliens.
You receive feedback telling you which alien won after you have made a choice.
You have to make ten predictions per planet.
For each prediction, you encounter a new pair of aliens.
You are beamed to a new planet after making ten predictions, where you have to make predictions about a new and different athletic competition.

Planet 1:
Alien A scores 2.02 higher on attribute 1. Alien A scores 3.36 higher on attribute 2. You press <<A>>. Alien A wins.
Alien A scores 2.21 higher on attribute 1. Alien A scores 0.46 higher on attribute 2. You press <<A>>. Alien A wins.
Alien A scores 1.21 higher on attribute 1. Alien V scores 0.53 higher on attribute 2. You press <<V>>. Alien A wins.
Alien A scores 0.48 higher on attribute 1. Alien V scores 1.14 higher on attribute 2. You press <<A>>. Alien A wins.
Alien A scores 0.93 higher on attribute 1. Alien V scores 2.85 higher on attribute 2. You press <<V>>. Alien A wins.
Alien V scores 1.82 higher on attribute 1. Alien A scores 0.09 higher on attribute 2. You press <<V>>. Alien V wins.
Alien A scores 0.33 higher on attribute 1. Alien A scores 0.54 higher on attribute 2. You press <<A>>. Alien A wins.
Alien V scores 0.39 higher on attribute 1. Alien V scores 0.27 higher on attribute 2. You press <<V>>. Alien V wins.
Alien V scores 0.33 higher on attribute 1. Alien V scores 0.91 higher on attribute 2. You press <<V>>. Alien V wins.
Alien V scores 3.12 higher on attribute 1. Alien V scores 1.08 higher on attribute 2. You press <<V>>. Alien V wins.

Planet 2:
Alien A scores 0.74 higher on attribute 1. Alien A scores 0.28 higher on attribute 2. You press <<A>>. Alien V wins.
Alien V scores 0.04 higher on attribute 1. Alien A scores 0.49 higher on attribute 2. You press <<V>>. Alien A wins.
Alien V scores 1.02 higher on attribute 1. Alien V scores 2.08 higher on attribute 2. You press <<V>>. Alien A wins.
Alien A scores 1.14 higher on attribute 1. Alien A scores 0.99 higher on attribute 2. You press <<V>>. Alien V wins.
Alien A scores 1.18 higher on attribute 1. Alien A scores 1.74 higher on attribute 2. You press <<V>>. Alien V wins.
Alien V scores 0.33 higher on attribute 1. Alien V scores 1.32 higher on attribute 2. You press <<A>>. Alien A wins.
Alien A scores 0.08 higher on attribute 1. Alien A scores 1.15 higher on attribute 2. You press <<V>>. Alien A wins.
Alien V scores 3.02 higher on attribute 1. Alien V scores 3.11 higher on attribute 2. You press <<A>>. Alien A wins.
Alien V scores 1.8 higher on attribute 1. Alien V scores 1.67 higher on attribute 2. You press <<A>>. Alien A wins.
Alien V scores 0.06 higher on attribute 1. Alien A scores 0.69 higher on attribute 2. You press <<A>>. Alien A wins.

...

Planet 30:
Alien V scores 0.24 higher on attribute 1. Alien A scores 1.1 higher on attribute 2. You press <<A>>. Alien V wins.
Alien V scores 0.8 higher on attribute 1. Alien V scores 1.12 higher on attribute 2. You press <<V>>. Alien V wins.
Alien V scores 1.22 higher on attribute 1. Alien A scores 0.47 higher on attribute 2. You press <<V>>. Alien V wins.
Alien V scores 0.25 higher on attribute 1. Alien V scores 0.74 higher on attribute 2. You press <<V>>. Alien V wins.
Alien V scores 1.61 higher on attribute 1. Alien V scores 0.39 higher on attribute 2. You press <<V>>. Alien V wins.
Alien V scores 1.72 higher on attribute 1. Alien V scores 0.03 higher on attribute 2. You press <<V>>. Alien V wins.
Alien V scores 2.06 higher on attribute 1. Alien V scores 1.09 higher on attribute 2. You press <<V>>. Alien V wins.
Alien A scores 0.24 higher on attribute 1. Alien V scores 0.48 higher on attribute 2. You press <<V>>. Alien A wins.
Alien V scores 0.7 higher on attribute 1. Alien V scores 1.43 higher on attribute 2. You press <<V>>. Alien V wins.
Alien V scores 0.7 higher on attribute 1. Alien A scores 0.39 higher on attribute 2. You press <<A>>. Alien A wins.

Name		Name	Last commit message	Last commit date
Latest commit History 65 Commits
awad2018moral		awad2018moral
bavard2023functional		bavard2023functional
binz2022heuristics		binz2022heuristics
demircan2024evaluatingcategory		demircan2024evaluatingcategory
demircan2024evaluatingreward		demircan2024evaluatingreward
dubois2022value		dubois2022value
fan2022trait		fan2022trait
feher2020humans		feher2020humans
guenther2020LDT		guenther2020LDT
guenther2020TS		guenther2020TS
guenther2022relational		guenther2022relational
guenther2023ViSpa		guenther2023ViSpa
guenther2023grammaticality		guenther2023grammaticality
guenther2024comprehension		guenther2024comprehension
jagadish2023zero		jagadish2023zero
jansen2021logic		jansen2021logic
marshall_2022_brightness		marshall_2022_brightness
pirrone_2018_dots		pirrone_2018_dots
pirrone_unpublished_food		pirrone_unpublished_food
pirrone_unpublished_lottery		pirrone_unpublished_lottery
russek2024heuristics		russek2024heuristics
screenshots		screenshots
singh2022representing		singh2022representing
witte_thalmann2024exploration		witte_thalmann2024exploration
xu2021novelty		xu2021novelty
.gitattributes		.gitattributes
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
README.md		README.md
stats.py		stats.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Psych-201

How to contribute

Repository structure

Setup

Review process

Formatting

About

Releases

Packages

Languages

schiekiera/Psych-201

Folders and files

Latest commit

History

Repository files navigation

Psych-201

How to contribute

Repository structure

Setup

Review process

Formatting

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages