A question about the performance

Thanks for your work and the pretrained models!

I'm playing around the pretrained models, however, the quality of the generated samples is not very good. Perhaps I did something wrong.

The only two changes I made are:

[https://github.com/thuhcsi/VoxInstruct/blob/e953295485bd88a293b37735c98b362554f4635b/model/ar.py#L42](https://github.com/thuhcsi/VoxInstruct/blob/e953295485bd88a293b37735c98b362554f4635b/model/ar.py#L42)
`_attn_implementation="flash_attention_2"`
->
`_attn_implementation="eager"`

[https://github.com/thuhcsi/VoxInstruct/blob/e953295485bd88a293b37735c98b362554f4635b/model/nar.py#L42](https://github.com/thuhcsi/VoxInstruct/blob/e953295485bd88a293b37735c98b362554f4635b/model/nar.py#L42)
`_attn_implementation="flash_attention_2"`
->
`_attn_implementation="eager"`

The reason for this change is that I only have V100 GPUs, which do not support flash-attn. The other hyperparameters are followed by `infer.sh`.

Thank you for your help!



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A question about the performance #2

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

A question about the performance #2

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions