Debugging and removing redundant lines

by unuu - opened 19 days ago

base: refs/heads/main

←

from: refs/pr/8

Discussion Files changed

-8

unuu

19 days ago

I think these changes are needed and fixes a bug but I don't know what testing process was done.
Please see this issue and if deemed valid, I will move the explanations here.

Debugging and removing redundant lines4a2e2346

SFLY5

BAIDU org 19 days ago

•

edited 19 days ago

Thanks for your advice! You can pass 'use_cache=False' or downgrade the transformers library to version 4.53.0 to avoid this bug. In our model, we use our own cache but not hf's cache.

generated_ids = model.generate(
    inputs=inputs['input_ids'].to(device),
    **inputs,
    max_new_tokens=128,
    use_cache=False
    )

unuu

14 days ago

@SFLY5 thank you for your response. However, using 'use_cache=False' would make inference very slow (I also confirmed my proposed changes doesn't solve the issue). I just tested downgrading transformers version but it does not seem to truly enable the caching logic (please point out if that is unexpected behavior).

AntonV

6 days ago

Hey @unuu ,
we are working on integrating ernie vl to transformers natively over here

For now, like pointed out - a lower version (not sure which one exactly) is required or you modify your installed files like here (disabling the transformers cache creation which is currently incompatible)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment