Things started failing after new commit into main #661

gane5hvarma · 2024-10-30T06:53:04Z

System Info

amazon linux 2
Running it in l40s

Information

Docker
The CLI directly

Tasks

An officially supported command
My own modifications

Reproduction

model=meta-llama/Meta-Llama-3.1-8B-Instruct
volume=$PWD/data
docker run -d --gpus all --env-file .env --shm-size 1g -p 8080:80 -v $volume:/data ghcr.io/predibase/lorax:main --model-id $model
See this error after model gets downloaded

'FlashLlamaAttention' object has no attribute 'fp8_kv'

Expected behavior

webserver should spin. It worked with older version. There seems to be problem after the new commit went into main

The text was updated successfully, but these errors were encountered:

arnavgarg1 · 2024-10-30T17:36:35Z

cc: @ajtejankar

ajtejankar · 2024-10-30T18:06:02Z

Should be fixed with #662

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Things started failing after new commit into main #661

Things started failing after new commit into main #661

gane5hvarma commented Oct 30, 2024 •

edited

Loading

arnavgarg1 commented Oct 30, 2024

ajtejankar commented Oct 30, 2024

Things started failing after new commit into main #661

Things started failing after new commit into main #661

Comments

gane5hvarma commented Oct 30, 2024 • edited Loading

System Info

Information

Tasks

Reproduction

Expected behavior

arnavgarg1 commented Oct 30, 2024

ajtejankar commented Oct 30, 2024

gane5hvarma commented Oct 30, 2024 •

edited

Loading