Skip to content

Question about ultrafineweb dataset preparation for tequila/sherry #233

@koshostanford

Description

@koshostanford

Hello,

I'm trying to reproduce the results of tequila/sherry. But I'm not seeing the accuracy reported by the paper (I attached the tequila results here, trained by me). I suspect it's the way I prepare the dataset is not the same as the authors did. Can you share how we can prepare the ultrafineweb dataset for the tequila/sherry QAT.

Image

Thanks very much
Sho

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions