Have the basics of ollama #4

DanielMarchand · 2024-07-14T20:49:32Z

The basics work. The problem is the code base is not very well-designed to handled custom prompting depending on the model. For example wake up dates require longer token limits with the llama3 models than with opean ai ones. Also I had to switch from system to assistant in the chat complete to get better answers, there are other subtle differences in how the prompts need to be set up, would be nice to discuss an overall architecture for this. Otherwise I think this is a really cool direction letting people with decent GPUs (tested on 3080, i'm sure 4090 would be even more special) get nice results at no cost.

This is heavily based on joonspk-research#155 by ketsapiwiq I had do some aspects differently but much of the logic is the same

… for different backends

… period of time

… tasks

… is becoming relatively stable

…p schedule prompts

… behaviour)

…rying to figure out Django view exceptions

…l llm

…o a json for later model checking and evaluation

…raining the use of triples

…the only one I know that 'should' work

Support vLLM on EC2 instances

DanielMarchand added 30 commits July 14, 2024 22:42

added ollama requirements to pip

fbb12c5

added basic ollama support

34302ca

more work on getting prompts to align for llama3

3b1b890

added more optional printing

e171aa0

more modifications to the prompts to make more friendly to local model

aabf19a

begun restructure of how prompts are stored and selected

a8bbee2

renamed openai_config to llm_config to reflect the increasing support…

2ddac3a

… for different backends

removed defunct run gpt prompt

a8fc578

moved the EXCEPT_ON_FAILSAFE to utils.py and updated docs

d7dd9b9

changed prompts to use the new prompt dirs

9047ed5

udpated hourly schedule prompt for better results

711e2e6

added note about the embeddings not working on n25

cd6c080

prompt changes for llama3 and cleaning up the task decomp function

c307729

updates to sections that generate (s,p,o) triples)

f87c65d

substantial improvements in local run time

d2201b9

local model very close to being able to run independantly for lengthy…

361607c

… period of time

made task decomp regex more robust, set min temperature to 0.1 in all…

5f76763

… tasks

fixing issues with generating focal point

ed07874

refactoring event triple, object event and insight and evidence, code…

d276734

… is becoming relatively stable

fixed instabilities in room locator, gave ChatGPT_safe more retries

bb20209

removed misleading comments (probably copy/pasted from another function)

559ceea

substantial work to generating triples, as well as location and decom…

8b5e12f

…p schedule prompts

restored to only using the debug variable in utils

983b34b

added EXCEPT_ON_FAILSAFE=False default to utils.py (preserves default…

a1dbd26

… behaviour)

removed symbol in emojii (may cause problems on windows though?)

5a61885

removed some dead comments and print statements

0fede96

except on too many retries set to False

5181a3a

added simulation name validator so nobody else has to waste an hour t…

4d6e5d1

…rying to figure out Django view exceptions

Merge commit '4d6e5d18' into fix-and-improve_add-ollama

644005f

fixed issue in merge

8dee870

DanielMarchand added 9 commits July 26, 2024 10:45

improved parsing of some outputs, code is now largely stable for loca…

c1ed7fc

…l llm

made compress sim use a command line argument

b08ea5c

made the safe_generate command easier to parse the complete trace int…

1da280d

…o a json for later model checking and evaluation

switching to mistralnemo llm

10bc2c0

finally replaced all ChatGPT_safe_response with safe_response, fine-g…

561c9c7

…raining the use of triples

fixed bug in logging that would mess up the retry count

8907061

poignancy more robust to errors, failsafe set to 1

c5d6351

moved nemo prompts to the other repos, due to parser changes this is …

9d49a96

…the only one I know that 'should' work

place the extracted json files into separate directories

004bf75

chowington referenced this pull request in crcresearch/agentic_collab Sep 30, 2024

Merge pull request #4 from batterylake/aws

844312b

Support vLLM on EC2 instances

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Have the basics of ollama #4

Have the basics of ollama #4

DanielMarchand commented Jul 14, 2024 •

edited

Loading

Have the basics of ollama #4

Are you sure you want to change the base?

Have the basics of ollama #4

Conversation

DanielMarchand commented Jul 14, 2024 • edited Loading

DanielMarchand commented Jul 14, 2024 •

edited

Loading