Skip Navigation

Hacker News @derp.foo haxor @derp.foo

1y ago

Outperforming LLMs with less training data and smaller model sizes

blog.research.google Distilling step-by-step: Outperforming larger language models with less training data and smaller model sizes

Distilling step-by-step: Outperforming larger language models with less training data and smaller model sizes

There is a discussion on Hacker News, but feel free to comment here as well.

LocalLLaMA @sh.itjust.works Zetaphor @zemmy.cc 1y ago

Distilling step-by-step: Outperforming larger language models with less training data and smaller model sizes

blog.research.google /2023/09/distilling-step-by-step-outperforming.html

Distilling step-by-step: Outperforming larger language models with less training data and smaller model sizes

28 3

Generative AI @mander.xyz fossilesque @mander.xyz 1y ago

Distilling step-by-step: Outperforming larger language models with less training data and smaller model sizes

blog.research.google /2023/09/distilling-step-by-step-outperforming.html

Distilling step-by-step: Outperforming larger language models with less training data and smaller model sizes

1 0

Hacker News @lemmy.smeargle.fans bot @lemmy.smeargle.fans

1y ago

Outperforming larger language models with less training data and smaller models

blog.research.google /2023/09/distilling-step-by-step-outperforming.html

Outperforming larger language models with less training data and smaller models

6 0

0 comments