Richard Chen
Instructor credited with teaching the SGLang short course. Relevant as a practitioner translating applied inference techniques into learning material.
Key Highlights
- Richard Chen is credited as the instructor of Andrew Ng’s short course on efficient inference with SGLang.
- His relevance comes from teaching practical methods to reduce redundant LLM costs using caching.
- For AI PMs, his work highlights the growing importance of inference efficiency as a product decision lever.
- Chen is connected to an ecosystem including Andrew Ng, LMSys, RadixArk, and SGLang.
- The available mentions position him as a practitioner translating applied infrastructure techniques into accessible education.
Richard Chen
Overview
Richard Chen is referenced as the instructor for the short course “Efficient Inference with SGLang: Text and Image Generation,” a course unveiled by Andrew Ng and co-built with LMSys and RadixArk. In the available mentions, Chen’s significance comes from his role in teaching practitioners how to apply SGLang’s open-source caching framework to reduce redundant large model inference costs.For AI Product Managers, Richard Chen matters less as a public thought leader with a broad documented profile in this dataset and more as a practitioner-educator associated with turning advanced inference optimization techniques into accessible learning material. His appearance signals an important trend: product teams increasingly need operational fluency in inference efficiency, not just model capability, especially for text and image generation systems where latency and cost directly affect product viability.
Key Developments
- 2026-04-10 — Andrew Ng unveiled the short course “Efficient Inference with SGLang: Text and Image Generation,” co-built with LMSys and RadixArk and taught by Richard Chen.
- 2026-04-10 — Richard Chen was specifically credited with teaching how to use SGLang’s open-source caching framework to cut redundant LLM costs by processing shared prompt components more efficiently.
- 2026-04-10 — Chen’s mention alongside Andrew Ng, LMSys, and RadixArk positioned him within a practical ecosystem focused on applied inference optimization for generative AI workflows.
Relevance to AI PMs
- Translate infra advances into product metrics. Richard Chen’s teaching role centers on inference efficiency, which is directly useful for PMs managing margins, latency SLAs, and usage-based pricing. Techniques like prompt caching can materially change cost per request and user experience.
- Evaluate learning resources that improve team execution. Chen’s involvement suggests the course is geared toward practical implementation, making it relevant for PMs who need engineering and product teams aligned on deployable optimization methods rather than abstract model research.
- Prioritize architecture decisions early. The SGLang context highlights that shared prompt processing and caching are not just engineering details; they shape feature feasibility for high-volume copilots, multimodal generation tools, and enterprise AI products with repetitive context patterns.
Related
- Andrew Ng — Announced the short course taught by Richard Chen, providing the primary context for Chen’s mention.
- SGLang — The inference framework at the center of Chen’s teaching, particularly for caching and efficient text/image generation.
- LMSys — Co-builder of the course, linking Chen to the broader open-source and systems research ecosystem around LLM serving.
- RadixArk — Also co-built the course, connecting Chen to applied infrastructure work focused on efficient inference and deployment.
Newsletter Mentions (3)
“Andrew Ng unveiled a new short course, “Efficient Inference with SGLang: Text and Image Generation,” co-built with LMSys and RadixArk and taught by Richard Chen, teaching how to use SGLang’s open-source caching framework to slash redundant LLM costs by processing shared promp...”
#15 𝕏 Andrew Ng unveiled a new short course, “Efficient Inference with SGLang: Text and Image Generation,” co-built with LMSys and RadixArk and taught by Richard Chen, teaching how to use SGLang’s open-source caching framework to slash redundant LLM costs by processing shared promp...
“Andrew Ng unveiled a new short course, “Efficient Inference with SGLang: Text and Image Generation,” co-built with LMSys and RadixArk and taught by Richard Chen, teaching how to use SGLang’s open-source caching framework to slash redundant LLM costs by processing shared promp...”
#15 𝕏 Andrew Ng unveiled a new short course, “Efficient Inference with SGLang: Text and Image Generation,” co-built with LMSys and RadixArk and taught by Richard Chen, teaching how to use SGLang’s open-source caching framework to slash redundant LLM costs by processing shared promp...
“Andrew Ng unveiled a new short course, “Efficient Inference with SGLang: Text and Image Generation,” co-built with LMSys and RadixArk and taught by Richard Chen, teaching how to use SGLang’s open-source caching framework to slash redundant LLM costs by processing shared promp...”
Andrew Ng unveiled a new short course, “Efficient Inference with SGLang: Text and Image Generation,” co-built with LMSys and RadixArk and taught by Richard Chen, teaching how to use SGLang’s open-source caching framework to slash redundant LLM costs by processing shared promp... #16 𝕏 Santiago : They’ve built a completely new Large Memory Models architecture that mimics human memory instead of using RAG or vector search. The founders—authors of 160+ Nature and ICLR papers—even closed their Harvard lab to focus on it.
Related
AI educator, entrepreneur, and founder known for AI courses and applied machine learning. Here he is credited with a short course on self-evaluating agents.
An open-source inference framework highlighted for high throughput on NVIDIA Blackwell hardware. Useful for AI PMs working on deployment, serving, and latency optimization.
A research organization associated with language model systems and benchmarking. It appears here as a co-builder of an applied short course.
A company or organization co-building an applied AI course with Andrew Ng and LMSys. It is relevant as an ecosystem partner in AI education and tooling.
Stay updated on Richard Chen
Get curated AI PM insights delivered daily — covering this and 1,000+ other sources.
Subscribe Free