You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In addition, will there be any issues with the loss calculation method in the code for SFT data where the labels contain values of -100 (the prompt and padding parts)?
I noticed that the code does not support the passing of attention_mask, making it impossible to perform padding operations for SFT data?
EasyContext/easy_context/zigzag_ring_attn/monkey_patch.py
Line 26 in fe49492
In addition, will there be any issues with the loss calculation method in the code for SFT data where the labels contain values of -100 (the prompt and padding parts)?
EasyContext/train.py
Lines 117 to 138 in fe49492
Look forward to your response. Thank you.
The text was updated successfully, but these errors were encountered: