Learn Machine Learning

Google open sources tools to support AI model development

techcrunch.com Google open sources tools to support AI model development | TechCrunch

Google is launching Jetstream, a new engine to run generative AI models, and MaxDiffusion, a collection of reference implementations of various diffusion models.

1

ylai @lemmy.ml 8mo ago

pytorch.org Finetune LLMs on your own consumer hardware using tools from PyTorch and Hugging Face ecosystem

We demonstrate how to finetune a 7B parameter model on a typical consumer GPU (NVIDIA T4 16GB) with LoRA and tools from the PyTorch and Hugging Face ecosystem with complete reproducible Google Colab notebook.

0

ylai @lemmy.ml 9mo ago

pytorch.org Understanding GPU Memory 2: Finding and Removing Reference Cycles

This is part 2 of the Understanding GPU Memory blog series. Our first post Understanding GPU Memory 1: Visualizing All Allocations over Time shows how to use the memory snapshot tool. In this part, we will use the Memory Snapshot to visualize a GPU memory leak caused by reference cycles, and then l...

0

ylai @lemmy.ml 11mo ago

PyTorch: Compiling NumPy code into C++ or CUDA via torch.compile

pytorch.org PyTorch

An open source machine learning framework that accelerates the path from research prototyping to production deployment.

0

ShadowAether @sh.itjust.works 12mo ago

Introduction to Kernel Methods for Machine Learning

seis.bristol.ac.uk /~enicgc/pubs/2000/svmintro.pdf

Kernel methods give a systematic and principled approach to training learning machines and the good generalization performance achieved can be readily justified using statistical learning theory or Bayesian arguments. We describe how to use kernel methods for classification, regression and novelty detection and in each case we find that training can be reduced to optimization of a convex cost function.

0

ShadowAether @sh.itjust.works 12mo ago

The Kernel Cookbook: Advice on Covariance functions

www.cs.toronto.edu /~duvenaud/cookbook/

If you've ever asked yourself: "How do I choose the covariance function for a Gaussian process?" this is the page for you. Here you'll find concrete advice on how to choose a covariance function for your problem, or better yet, make your own.

0

ShadowAether @sh.itjust.works 12mo ago

An Intuitive Tutorial to Gaussian Processes Regression

arxiv.org /abs/2009.10862

This tutorial aims to provide an intuitive understanding of the Gaussian processes regression. Gaussian processes regression (GPR) models have been widely used in machine learning applications because of their representation flexibility and inherent uncertainty measures over predictions.

0

manitcor @lemmy.intai.tech 12mo ago

Applied Machine Learning (Cornell Tech CS 5787, Fall 2020)

www.youtube.com /playlist

0

manitcor @lemmy.intai.tech 12mo ago

DeepMind x UCL | Reinforcement Learning Course 2018

www.youtube.com /playlist

0

Chrüsimüsi @feddit.ch 12mo ago

blog.research.google Distilling step-by-step: Outperforming larger language models with less training data and smaller model sizes

Large language models (LLMs) are data-efficient but their size makes them difficult to deploy in real-world scenarios.

"Distilling Step-by-Step" is a new method introduced by Google researchers that enables smaller models to outperform LLMs using less training data. This method extracts natural language rationales from LLMs, which provide intermediate reasoning steps, and uses these rationales to train smaller models more efficiently.

In experiments, the distilling step-by-step method consistently outperformed LLMs and standard training approaches, offering both reduced model size and reduced training data requirements.

0

manitcor @lemmy.intai.tech 1y ago

Dr Stephen Wolfram says THIS about ChatGPT, Natural Language and Physics

Learn Machine Learning

YouTube Video