Data Preparation Livestream

Trelis Research

May 29

and upcoming vids

Read →

6 Comments

Damom

May 29Liked by Trelis Research

+ https://arxiv.org/abs/2404.16811 = perfect FT model?? Reg - Damon

Expand full comment

Reply (1)

Trelis Research

May 30Author

Nice yeah it makes intuitive sense that a custom dataset requiring answers from questions/data in the middle of text can improve the lost-in-the-middle problem.

Expand full comment

Damom

May 29Liked by Trelis Research

Hi Roman, look at https://arxiv.org/abs/2405.16684 new logic concept: scale law/parameters/gzip

Expand full comment

Reply (1)

Trelis Research

May 30Author

Interesting, so compressibility gives some sense of the data quality. Seems that if your data is more compressible, you need more of it (or at least should weigh towards more data than increasing model size).

Expand full comment

codeforfun

Jul 27

is there a recording for this session? Thanks

Expand full comment

Reply (1)

Trelis Research

Jul 28Author

here you go! https://youtube.com/live/lQh0WLxdYos

Expand full comment

Trelis Research Updates

Data Preparation Livestream