Context by Cohere
  • Cohere.ai
  • Docs
  • Pricing
Get Started
Jim Wu

Jim Wu

1 post published

https://jimwu.ca
Cohere Boosts Inference Speed With NVIDIA Triton Inference Server

Cohere Boosts Inference Speed With NVIDIA Triton Inference Server

The adoption of large language models (LLMs) is on the rise. Previously, many natural language processing (NLP) use cases required deploying several different models. With LLMs, one general-purpose model can support a wide variety of NLP use cases, greatly simplifying the integration of language-based machine learning capabilities, such as text

  • Bharat Venkitesh
  • Jim Wu
Bharat Venkitesh, Jim Wu Oct 7, 2022 • 3 min read
Cohere © 2023
  • Cohere.ai
  • Get Started
  • About
  • Classify
  • Generate
  • Responsibility
  • Documentation
  • Careers