How to pass in an attention _ mask that is one dimension more than input _ ids #2301

Chinesehou97 · 2024-12-31T02:26:14Z

System Info

Hello, how can I pass in attention_mask that has one more dimension than input_ids, for example: output = peft_model.generate(input_ids,attention_mask=attention_mask,max_new_tokens=100) The input_ids dimension is [bitch_size,N], and the attention_mask dimension is [bitch_size,N,N].
Under this condition, when the above line of code is run, the following error will be reported:
File "/root/anaconda3/lib/python3.10/site-packages/transformers/modeling_attn_mask_utils.py", line 179, in _expand_mask bsz, src_len = mask.size()
ValueError: too many values to unpack (expected 2)

Who can help?

No response

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder
My own task or dataset (give details below)

Reproduction

`

                input_ids = torch.cat([
                    (torch.ones(input_ids.shape[0], 1) * uni_prompting.sptids_dict['<|mmu|>']).to(device),
                    (torch.ones(input_ids.shape[0], 1) * uni_prompting.sptids_dict['<|soi|>']).to(device),
                    image_tokens,
                    (torch.ones(input_ids.shape[0], 1) * uni_prompting.sptids_dict['<|eoi|>']).to(device),
                    (torch.ones(input_ids.shape[0], 1) * uni_prompting.sptids_dict['<|sot|>']).to(device),
                    input_ids
                ], dim=1).long()

                attention_mask = create_attention_mask_for_mmu(input_ids.to(device),
                                                            eoi_id=int(uni_prompting.sptids_dict['<|eoi|>']))
                cont_toks_list = peft_model.generate(input_ids,attention_mask=attention_mask,max_new_tokens=100)`

Expected behavior

Read the model for fine-tuning and reasoning.

The text was updated successfully, but these errors were encountered:

BenjaminBossan · 2025-01-06T10:14:57Z

Would it be possible for you to provide a complete code to reproduce the error, the given snippet is not enough. The model doesn't need to be trained, just ensure to configure the same PEFT method as in your initial problem.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to pass in an attention _ mask that is one dimension more than input _ ids #2301

How to pass in an attention _ mask that is one dimension more than input _ ids #2301

Chinesehou97 commented Dec 31, 2024

BenjaminBossan commented Jan 6, 2025

How to pass in an attention _ mask that is one dimension more than input _ ids #2301

How to pass in an attention _ mask that is one dimension more than input _ ids #2301

Comments

Chinesehou97 commented Dec 31, 2024

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

BenjaminBossan commented Jan 6, 2025