Skip Navigation

Hacker News @lemmy.smeargle.fans bot @lemmy.smeargle.fans

10mo ago

Outperforming larger language models with less training data and smaller models

blog.research.google Distilling step-by-step: Outperforming larger language models with less training data and smaller model sizes

Distilling step-by-step: Outperforming larger language models with less training data and smaller model sizes

LocalLLaMA @sh.itjust.works Zetaphor @zemmy.cc 10mo ago

Distilling step-by-step: Outperforming larger language models with less training data and smaller model sizes

blog.research.google /2023/09/distilling-step-by-step-outperforming.html

Distilling step-by-step: Outperforming larger language models with less training data and smaller model sizes

28 3

Generative AI @mander.xyz fossilesque @mander.xyz 10mo ago

Distilling step-by-step: Outperforming larger language models with less training data and smaller model sizes

blog.research.google /2023/09/distilling-step-by-step-outperforming.html

Distilling step-by-step: Outperforming larger language models with less training data and smaller model sizes

1 0

Hacker News @derp.foo haxor @derp.foo

10mo ago

Outperforming LLMs with less training data and smaller model sizes

blog.research.google /2023/09/distilling-step-by-step-outperforming.html

Outperforming LLMs with less training data and smaller model sizes

0 0

0 comments