Skip to content

Commit

Permalink
vault backup: 2024-03-05 - 1 files
Browse files Browse the repository at this point in the history
Affected files:
Monthly Notes/Mar 2024 notes.md
  • Loading branch information
swyx committed Mar 5, 2024
1 parent 0ef8c4b commit 7295aa1
Showing 1 changed file with 3 additions and 2 deletions.
5 changes: 3 additions & 2 deletions Monthly Notes/Mar 2024 notes.md
Original file line number Diff line number Diff line change
Expand Up @@ -19,7 +19,7 @@
- Lower refusal rate - very good to combat anthropic safetyist image and topical vs gemini issues from feb
- "Opus not only found the needle, it recognized that the inserted needle was so out of place in the haystack that this had to be an artificial test constructed by us to test its attention abilities." [from Anthropic prompt engineer](https://twitter.com/alexalbert__/status/1764722513014329620)
- criticized by [MMitchell](https://x.com/mmitchell_ai/status/1764739357112713267?s=20) and [Connor Leahy](https://x.com/NPCollapse/status/1764740710731837516?s=20) and [Delip Rao](https://x.com/deliprao/status/1764675843542995026?s=20)
- could be overrated - [GPT3 also does it because the needle is so out of context](https://x.com/zggyplaydguitar/status/1764791981782262103?s=46&t=90xQ8sGy63D2OtiaoGJuww)
- could be overrated - [GPT3 also does it because the needle is so out of context](https://x.com/zggyplaydguitar/status/1764791981782262103?s=46&t=90xQ8sGy63D2OtiaoGJuww). [Jim fan](https://twitter.com/DrJimFan/status/1765076396404363435) and [yannic kilcher agree]([https://youtu.be/GBOE9fVVVSM?si=IBMCYkmSiVg-MrFr](https://youtu.be/GBOE9fVVVSM?si=IBMCYkmSiVg-MrFr "https://youtu.be/GBOE9fVVVSM?si=IBMCYkmSiVg-MrFr"))
- Evals
- [choosing to highlight Finance, Medicine, Philosophy domain evals rather than MMLU/HumanEval is good](https://twitter.com/DrJimFan/status/1764719012678897738)
- [59.5% on GPQA](https://x.com/idavidrein/status/1764675668175094169?s=20) is much better than generalist PhDs and GPT4 - GPQA author is impressed. [paper]([arxiv.org/abs/2311.12022](https://t.co/hb4u4xXzkw)).
Expand Down Expand Up @@ -55,4 +55,5 @@

## reading and learning

- both [thebloke](https://twitter.com/osanseviero/status/1765103307352055991) and teknium got their start 1 yr ago
- both [thebloke](https://twitter.com/osanseviero/status/1765103307352055991) and teknium got their start 1 yr ago
- [Training great LLMs from ground zero](https://www.yitay.net/blog/training-great-llms-entirely-from-ground-zero-in-the-wilderness) - Yi Tay of Reka

0 comments on commit 7295aa1

Please sign in to comment.