Ray 示例#

Blog

Ray 如何解决生成式 AI 基础设施的常见生产挑战

Blog

使用 Alpa 和 Ray 在 1000 GPU 规模上训练 175B 参数语言模型

Blog

使用 Ray Train 进行更快稳定的扩散微调

Blog

How to fine tune and serve LLMs simply, quickly and cost effectively using Ray + DeepSpeed + HuggingFace

Blog

OpenAI 如何使用 Ray 训练 ChatGPT 等工具

Code example

GPT-J-6B Fine-Tuning with Ray Train and DeepSpeed

Code example

Aviary 工具集为 LLM 提供实时流量服务

Tutorial

通过现有的 PyTorch 代码库开始使用 Ray Train

Code example

LightGBM 分布式训练

Tutorial

XGBoost 分布式训练

Code example

如何利用 Ray Data 对 Hugging Face Transformers GPT-J 模型进行批量预测

Code example

如何利用 Ray Serve 对 Hugging Face Transformers GPT-J 模型进行在线服务

Code example

如何使用自己的图像对 DreamBooth 文本到图像模型进行微调

Code example

使用 Ray Train、PyTorch Lightning 和 FSDP 对 dolly-v2-7b 进行微调

Code example

使用 Ray Train 的 Torch 对象检测示例

Code example

使用 PyTorch ResNet152 进行图像分类批量推理

Code example

如何使用 Ray Data 对 Stable Diffusion 文本到图像模型进行批量预测

Code example

使用 PyTorch FasterRCNN_ResNet50 进行对象检测批量推理

Code example

使用 PyTorch ResNet18 进行图像分类批量推理

Code example

使用 Huggingface Vision Transformer 进行图像分类批量推理

Code example

使用 Ray Serve 服务 ML 模型（Tensorflow、PyTorch、Scikit-Learn 等）

Code example

使用 Ray Serve 进行批处理

Code example

使用 Ray Serve 服务 RLlib 模型

Code example

使用 Ray Serve 扩展 Gradio 应用

Code example

使用 Ray Serve 和 Gradio 可视化部署图

Code example

使用 Ray Serve 的 Java 教程

Code example

服务 Stable Diffusion 模型

Code example

服务 Distilbert 模型

Code example

在 AWS NeuronCore 上服务 Bert 模型

Code example

服务对象检测模型

Blog

离线批量推理：比较 Ray、Apache Spark 和 SageMaker

Blog

通过 CPU 和 GPU 进行流式分布式执行

Blog

使用 Ray Data 并行化 LangChain 推理

Blog

使用 Ray Data 进行批量预测

Code example

使用 Ray Data 对 NYC 出租车数据进行批量推理

Code example

使用 Ray Data 进行批量 OCR 处理

Blog

使用 Ray 在记录时间内训练一百万个机器学习模型

Blog

使用 Ray Core 批量训练大规模模型

Code example

使用 Ray Core 进行批量训练

Code example

使用 Ray Data 进行批量训练

Tutorial

基础并行实验

Code example

使用 Ray Tune 进行批量训练和调优

Video

在 Ray 上扩展 Instacart 履行 ML

Code example

使用 Aim 与 Ray Tune 进行实验管理

Code example

使用 Comet 与 Ray Tune 进行实验管理

Code example

Tracking Your Experiment Process Weights & Biases

Code example

使用 MLflow 跟踪和 Tune 进行自动记录

Code example

如何使用 Ax 与 Ray Tune

Code example

如何使用 Dragonfly 与 Ray Tune

Code example

如何使用 HyperOpt 与 Ray Tune

Code example

如何使用 BayesOpt 与 Ray Tune

Code example

如何使用 BlendSearch 和 CFO 与 Ray Tune

Code example

如何使用 Tune 与 TuneBOHB

Code example

如何使用 Nevergrad 与 Ray Tune

Code example

如何使用 Optuna 与 Ray Tune

Code example

如何使用 SigOpt 与 Ray Tune

Video

使用 Ray Serve 在规模上实现 ML 生产

Blog

使用 Ray & Ray Serve 简化您的 MLOps

Tutorial

Ray Serve 入门

Tutorial

在 Serve 中进行模型组合

Tutorial

Ray Tune 入门

Blog

如何使用 Ray Tune 分布式超参数调优

Video

简单的分布式超参数优化

Blog

Hyperparameter Search with 🤗 Transformers

Code example

如何使用 Tune 与 Keras 和 TF 模型

Code example

如何使用 Tune 与 PyTorch 模型

Code example

如何调整 PyTorch Lightning 模型

Code example

使用 Ray Serve 进行模型选择和服务

Code example

使用 Ray Tune 和 Ray Serve 调整 RL 实验

Code example

一个使用 Tune 调整 XGBoost 参数的指南

Code example

一个使用 Tune 调整 LightGBM 参数的指南

Code example

一个使用 Tune 调整 Horovod 参数的指南

Code example

一个使用 Tune 调整 Huggingface Transformers 参数的指南

Code example

更多关于 Ray Tune 的用例

Video

Ray Train, PyTorch, TorchX 和分布式深度学习

Tutorial

Ray Train 入门

Code example

在 GLUE 基准上微调 🤗 Transformers 模型

Code example

PyTorch Fashion MNIST 训练示例

Code example

TensorFlow MNIST 训练示例

Code example

端到端 Horovod 训练示例

Code example

端到端 PyTorch Lightning 训练示例

Code example

使用 Ray Data 对 PyTorch Lightning 文本分类器进行微调

Code example

端到端示例，展示如何使用 Ray Tune 调整 TensorFlow 模型

Code example

端到端示例，展示如何使用 Ray Tune 调整 PyTorch 模型

Course

使用 RLlib 进行应用强化学习

Blog

RLlib 的示例环境介绍

Code example

RLlib 算法的超参调整集合

Code example

针对 RLlib 的一系列合理优化的 Atari 和 MuJoCo 结果

Code example

RLlib 的轨迹视图 API 及如何实现 GTrXL（注意力网络）架构

Code example

一个关于如何将 RLlib 与 Unity3D 游戏引擎连接以运行基于视觉和物理的 RL 实验的指南

Code example

我们如何将 12 个 RLlib 算法从 TensorFlow 移植到 PyTorch，以及我们在此过程中学到的东西

Code example

这篇博客是关于多智能体 RL 及其在 RLlib 中的设计的简要教程

Code example

探索了一种用于实现强化学习（RL）算法的函数式范式

Code example

定义和注册一个 gym 环境和模型供 RLlib 使用的示例

Code example

如何在本地运行的 Unity3D 编辑器上设置 RLlib 算法的示例

Code example

环境的渲染和记录

Code example

基于 RLlib 的金币游戏示例

Code example

如何使用 DMLab 环境（Watermaze）的示例

Code example

RecSym 环境示例（用于推荐系统）使用 SlateQ 算法

Code example

演示如何在 RLlib 中使用 SUMO 仿真器的示例

Code example

VizDoom 示例脚本使用 RLlib 的 auto-attention 包装器

Code example

如何确保 RLlib 退出时由 envs 生成的子进程被终止的示例

Code example

Attention Net (GTrXL) 学习“重复我说的话”环境

Code example

展示如何在 RLlib 中使用自动 LSTM 包装器

Code example

使用自定义 Keras 或 PyTorch RNN 模型的示例

Code example

定义和注册具有监督损失的自定义模型的示例

Code example

添加批量规范化层到自定义模型的示例

Code example

如何利用 TensorFlow eager 简化自定义模型和策略的调试和设计的示例

Code example

只有一个参数的快速 tf 和 torch 模型示例

Code example

展示了如何在 RLlib 中定义自定义 Model API，以便它可以在某些算法中使用。

Code example

一个示例，展示了模型如何使用轨迹视图 API 来指定自己的输入。

Code example

Implementations of MobileNetV2 and torch.hub (mobilenet_v2)-wrapping example models.

Code example

Example of DeepMind’s Differentiable Neural Computer for partially-observable environments.

Code example

Example of how to use Tune’s support for custom training functions to implement custom training workflows.

Code example

Example of how to advance the environment through different phases (tasks) over time.

Code example

How to setup a custom Logger object in RLlib.

Code example

Example of how to output custom training metrics to TensorBoard.

Code example

How to setup a custom TFPolicy.

Code example

How to setup a custom TorchPolicy.

Code example

Example of how to use RLlib’s lower-level building blocks to implement a fully customized training workflow.

Code example

Example of how to use the exec. plan of an Algorithm to trin two different policies in parallel (also using multi-agent API).

Code example

How to run a custom Ray Tune experiment with RLlib with custom training- and evaluation phases.

Code example

Example of how to write a custom evaluation function that is called instead of the default behavior, which is running with the evaluation worker set through n episodes.

Code example

Example showing how the evaluation workers and the “normal” rollout workers can run (to some extend) in parallel to speed up training.

Code example

Example showing how to run an offline RL training job using a historic-data json file.

Code example

Example of using Ray Serve to serve RLlib models with HTTP and JSON interface

Code example

This script offers a simple workflow for 1) training a policy with RLlib first, 2) creating a new policy 3) restoring its weights from the trained one and serving the new policy via Ray Serve.

Code example

Example of how to setup n distributed Unity3D (compiled) games in the cloud that function as data collecting clients against a central RLlib Policy server learning how to play the game.

Code example

Example of online serving of predictions for a simple CartPole policy.

Code example

Example of how to externally generate experience batches in RLlib-compatible format.

Code example

Example of how to find a checkpoint after a Tuner.fit() via some custom defined criteria.

Code example

Setup RLlib to run any algorithm in (independent) multi-agent mode against a multi-agent environment.

Code example

Setup RLlib to run any algorithm in (shared-parameter) multi-agent mode against a multi-agent environment.

Code example

Example of different heuristic and learned policies competing against each other in rock-paper-scissors.

Code example

Example of the two-step game from the QMIX paper.

Code example

Example on how to use RLlib to learn in PettingZoo multi-agent environments.

Code example

Example of customizing PPO to leverage a centralized value function.

Code example

A simpler method of implementing a centralized critic by augmentating agent observations with global information.

Code example

Example of running a custom hand-coded policy alongside trainable policies.

Code example

Example of how to define weight-sharing layers between two different policies.

Code example

Example of alternating training between DQN and PPO.

Code example

Example of hierarchical training using the multi-agent API.

Code example

Example of an iterated prisoner’s dilemma environment solved by RLlib.

Code example

Example of how to setup fractional GPUs for learning (driver) and environment rollouts (remote workers).

Code example

Learning in arbitrarily nested action spaces.

Code example

Example of how to handle variable-length or parametric action spaces

Code example

How to filter raw observations coming from the environment for further processing by the Agent’s model(s).

Code example

How to use RLlib’s Repeated space to handle variable length observations.

Code example

Learning with auto-regressive action dependencies (e.g. 2 action components; distribution for 2nd component depends on the 1st component’s actually sampled value).

Code example

A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence with RLlib-generated baselines.

Code example

Example of training autonomous vehicles with RLlib and CARLA simulator.

Code example

Using Graph Neural Networks and RLlib to train multiple cooperative and adversarial agents to solve the “cover the area”-problem, thereby learning how to best communicate (or - in the adversarial case - how to disturb communication).

Code example

A dense traffic simulating environment with RLlib-generated baselines.

Code example

Example of setting up a multi-agent version of GFootball with RLlib.

Code example

A multiagent AI research environment inspired by Massively Multiplayer Online (MMO) role playing games

Code example

Example of building packet classification trees using RLlib / multi-agent in a bandit-like setting.

Code example

Example of learning optimal LLVM vectorization compiler pragmas for loops in C and C++ codes using RLlib.

Code example

Example of using the multi-agent API to model several social dilemma games.

Code example

Create a custom environment and train a single agent RL using Ray 2.0 with Tune and Air.

Code example

Example of training in StarCraft2 maps with RLlib / multi-agent.

Code example

Example of optimizing mixed-autonomy traffic simulations with RLlib / multi-agent.

Code example

Working with custom Keras models in RLlib

Tutorial

Getting Started with RLlib

Video

Deep reinforcement learning at Riot Games

Blog

The Magic of Merlin - Shopify’s New ML Platform

Tutorial

Large Scale Deep Learning Training and Tuning with Ray

Blog

Griffin: How Instacart’s ML Platform Tripled in a year

Video

Predibase - A low-code deep learning platform built for scale

Blog

Building a ML Platform with Kubeflow and Ray on GKE

Video

Ray Summit Panel - ML Platform on Ray

Code example

AutoML for Time Series with Ray

Blog

Highly Available and Scalable Online Applications on Ray at Ant Group

Blog

Ray Forward 2022 Conference: Hyper-scale Ray Application Use Cases

Blog

A new world record on the CloudSort benchmark using Ray

Code example

Speed up your web crawler by parallelizing it with Ray

Tutorial

Image Classification Batch Inference with Huggingface Vision Transformer

Tutorial

Image Classification Batch Inference with PyTorch ResNet152

Tutorial

Object Detection Batch Inference with PyTorch FasterRCNN_ResNet50

Tutorial

Processing the NYC taxi dataset

Tutorial

Batch Training with Ray Data

Tutorial

Scaling OCR with Ray Data

Code example

Random Data Access (Experimental)

Tutorial

Implementing a Custom Datasource

Code example

Build Batch Prediction Using Ray

Code example

Build a Simple Parameter Server Using Ray

Code example

Simple Parallel Model Selection

Code example

Fault-Tolerant Fairseq Training

Code example

Learning to Play Pong

Code example

Asynchronous Advantage Actor Critic (A3C)

Code example

A Gentle Introduction to Ray Core by Example

Code example

Using Ray for Highly Parallelizable Tasks

Code example

Running a Simple MapReduce Example with Ray Core

Code example

Benchmark example for the PyTorch data transfer auto pipeline

Tutorial

How To Use Tune’s Scikit-Learn Adapters?

Code example

Simple example for doing a basic random and grid search.

Code example

Example of using a simple tuning function with AsyncHyperBandScheduler.

Code example

Example of using a Trainable function with HyperBandScheduler. Also uses the AsyncHyperBandScheduler.

Tutorial

Configuring and running (synchronous) PBT and understanding the underlying algorithm behavior with a simple example.

Tutorial

Example of using the function API with a PopulationBasedTraining scheduler.

Code example

Example of using the Population-based Bandits (PB2) scheduler.

Code example

Example of custom loggers and custom trial directory naming.

Code example

Tune 的基础使用

Code example

Using Search algorithms and Trial Schedulers to optimize your model.

Code example

Using Population-Based Training (PBT).

Code example

Fine-tuning Huggingface Transformers with PBT.

Code example

Logging Tune Runs to Comet ML.

Tutorial

Using Ray Serve to deploy a chatbot

Code example

Fine-tune vicuna-13b-v1.3 with DeepSpeed, PyTorch Lightning and Ray Train

Code example

分布式 DeepSpeed ZeRO-3 及 TorchTrainer 训练

Code example

分布式 Hugging Face Accelelate 及 TorchTrainer 训练

Code example

RayJob 在 Kubernetes 构建的 Ray 上进行批量推理示例

Ray 2.7.2

Ray 示例

Ray 示例#

Sorry! We could not find an example matching that filter.