Make model type backwards compatible #1212

kunal-vaishnavi · 2025-01-31T18:18:26Z

Description

This PR makes accessing the model type possible by reading from the GenAI config if the model object does not contain the type attribute. It also adds the chat and system templates for Qwen models.

Motivation and Context

This PR allows the examples to be backwards compatible with the published RCs for v0.6.0.

The Qwen chat template and Qwen system template were obtained from the following information.

>>> from transformers import AutoTokenizer
>>> tokenizer = AutoTokenizer.from_pretrained("Qwen/Qwen2.5-0.5B-Instruct", cache_dir="./cache_dir")
tokenizer_config.json: 100%|█████████████████████████████████████████████████████████████████████████████| 7.30k/7.30k [00:00<00:00, 9.64MB/s]
vocab.json: 100%|████████████████████████████████████████████████████████████████████████████████████████| 2.78M/2.78M [00:00<00:00, 16.5MB/s]
merges.txt: 100%|████████████████████████████████████████████████████████████████████████████████████████| 1.67M/1.67M [00:00<00:00, 15.6MB/s]
tokenizer.json: 100%|████████████████████████████████████████████████████████████████████████████████████| 7.03M/7.03M [00:00<00:00, 21.5MB/s]
>>> prompt = "Give me a short introduction to large language model."
>>> messages = [ {"role": "system", "content": "You are Qwen, created by Alibaba Cloud. You are a helpful assistant."}, {"role": "user", "content": prompt} ]
>>> text = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
>>> text
'<|im_start|>system\nYou are Qwen, created by Alibaba Cloud. You are a helpful assistant.<|im_end|>\n<|im_start|>user\nGive me a short introduction to large language model.<|im_end|>\n<|im_start|>assistant\n'

examples/python/model-chat.py

MaanavD

LGTM. Tested and works!

apsonawane · 2025-01-31T22:25:09Z

In model-chat.py in line 105 should we not have this line to modify prompt based on chat_template?
prompt = f'{chat_template.format(input=text)}'

### Description This PR replaces the dummy Hugging Face authentication token with an actual read-only token. ### Motivation and Context This fixes the CI failures that are happening. This change is separated from [this PR](#1212) for clarity in the repo's commit history.

### Description This PR makes accessing the model type possible by reading from the GenAI config if the model object does not contain the type attribute. It also adds the chat and system templates for Qwen models. ### Motivation and Context This PR allows the examples to be backwards compatible with the published RCs for v0.6.0. The Qwen chat template and Qwen system template were obtained from the following information. ``` >>> from transformers import AutoTokenizer >>> tokenizer = AutoTokenizer.from_pretrained("Qwen/Qwen2.5-0.5B-Instruct", cache_dir="./cache_dir") tokenizer_config.json: 100%|█████████████████████████████████████████████████████████████████████████████| 7.30k/7.30k [00:00<00:00, 9.64MB/s] vocab.json: 100%|████████████████████████████████████████████████████████████████████████████████████████| 2.78M/2.78M [00:00<00:00, 16.5MB/s] merges.txt: 100%|████████████████████████████████████████████████████████████████████████████████████████| 1.67M/1.67M [00:00<00:00, 15.6MB/s] tokenizer.json: 100%|████████████████████████████████████████████████████████████████████████████████████| 7.03M/7.03M [00:00<00:00, 21.5MB/s] >>> prompt = "Give me a short introduction to large language model." >>> messages = [ {"role": "system", "content": "You are Qwen, created by Alibaba Cloud. You are a helpful assistant."}, {"role": "user", "content": prompt} ] >>> text = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True) >>> text '<|im_start|>system\nYou are Qwen, created by Alibaba Cloud. You are a helpful assistant.<|im_end|>\n<|im_start|>user\nGive me a short introduction to large language model.<|im_end|>\n<|im_start|>assistant\n' ```

kunal-vaishnavi added 5 commits January 31, 2025 17:57

Make model type backwards compatible

aef4c18

Add missing import

5e17c66

Open file before loading JSON

bb7591e

Add Qwen chat and system templates

e8c48b0

Allow user-provided templates to override defaults

6f2acc6

baijumeswani reviewed Jan 31, 2025

View reviewed changes

examples/python/model-chat.py Outdated Show resolved Hide resolved

kunal-vaishnavi added the 0.6.0 label Jan 31, 2025

baijumeswani previously approved these changes Jan 31, 2025

View reviewed changes

Make templates more flexible to user input

2bfcf68

kunal-vaishnavi dismissed baijumeswani’s stale review via 2bfcf68 January 31, 2025 19:45

apsonawane reviewed Jan 31, 2025

View reviewed changes

examples/python/model-chat.py Outdated Show resolved Hide resolved

Fix naming bug

bedea86

MaanavD previously approved these changes Jan 31, 2025

View reviewed changes

Fix case where chat template is not provided

ca2b705

kunal-vaishnavi dismissed MaanavD’s stale review via ca2b705 January 31, 2025 20:57

kunal-vaishnavi added 3 commits January 31, 2025 20:58

Fix typo in system prompt condition

d74a458

Try lowering transformers version for CIs

057f82b

Try lowering Hugging Face hub version for CIs

e880ca9

Add actual HF token

53f1432

baijumeswani previously approved these changes Jan 31, 2025

View reviewed changes

Always format chat template

e3ed1bc

kunal-vaishnavi dismissed baijumeswani’s stale review via e3ed1bc January 31, 2025 22:47

baijumeswani approved these changes Jan 31, 2025

View reviewed changes

apsonawane approved these changes Jan 31, 2025

View reviewed changes

kunal-vaishnavi mentioned this pull request Jan 31, 2025

Add actual Hugging Face token to CIs #1214

Merged

kunal-vaishnavi merged commit dba39b4 into main Feb 1, 2025
14 checks passed

kunal-vaishnavi deleted the kvaishnavi/model-type-in-examples branch February 1, 2025 00:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make model type backwards compatible #1212

Make model type backwards compatible #1212

kunal-vaishnavi commented Jan 31, 2025 •

edited

Loading

MaanavD left a comment

apsonawane commented Jan 31, 2025

Make model type backwards compatible #1212

Make model type backwards compatible #1212

Conversation

kunal-vaishnavi commented Jan 31, 2025 • edited Loading

Description

Motivation and Context

MaanavD left a comment

Choose a reason for hiding this comment

apsonawane commented Jan 31, 2025

kunal-vaishnavi commented Jan 31, 2025 •

edited

Loading