MORE POSTS
September 30, 2024
Wrapping up another Birthday Week celebration
Recapping all the big announcements made during 2024’s Birthday Week....
September 26, 2024
Cloudflare’s bigger, better, faster AI platform
Cloudflare helps you build AI applications with fast inference at the edge, optimized AI workflows, and vector database-powered RAG solutions....
July 23, 2024
Meta Llama 3.1 now available on Workers AI
Cloudflare is excited to be a launch partner with Meta to introduce Workers AI support for Llama 3.1...
June 27, 2024
Embedded function calling in Workers AI: easier, smarter, faster
Introducing a new way to do function calling in Workers AI by running function code alongside your inference. Plus, a new @cloudflare/ai-utils package to make getting started as simple as possible...
June 20, 2024
Introducing Stream Generated Captions, powered by Workers AI
With one click, users can now generate video captions effortlessly using Stream’s newest feature: AI-generated captions for on-demand videos and recordings of live streams...
May 22, 2024
AI Gateway is generally available: a unified interface for managing and scaling your generative AI workloads
AI Gateway is an AI ops platform that provides speed, reliability, and observability for your AI applications. With a single line of code, you can unlock powerful features including rate limiting, custom caching, real-time logs, and aggregated analytics across multiple providers...
April 18, 2024
Meta Llama 3 available on Cloudflare Workers AI
We are thrilled to give developers around the world the ability to build AI applications with Meta Llama 3 using Workers AI. We are proud to be a launch partner with Meta for their newest 8B Llama 3 model...
April 02, 2024
Leveling up Workers AI: general availability and more new capabilities
Today, we’re excited to make a series of announcements, including Workers AI, Cloudflare’s inference platform becoming GA and support for fine-tuned models with LoRAs and one-click deploys from HuggingFace. Cloudflare Workers now supports the Python programming language, and more...
April 02, 2024
Running fine-tuned models on Workers AI with LoRAs
Workers AI now supports fine-tuned models using LoRAs. But what is a LoRA and how does it work? In this post, we dive into fine-tuning, LoRAs and even some math to share the details of how it all works under the hood...
March 14, 2024
Mitigating a token-length side-channel attack in our AI products
The Workers AI and AI Gateway team recently collaborated closely with security researchers at Ben Gurion University regarding a report submitted through our Public Bug Bounty program. Through this process, we discovered and fully patched a vulnerability affecting all LLM provider...
March 04, 2024
Cloudflare launches AI Assistant for Security Analytics
Introducing AI Assistant for Security Analytics. Now it is easier than ever to get powerful insights about your web security. Use the new integrated natural language query interface to explore Security Analytics...
February 28, 2024
Unlocking new use cases with 17 new models in Workers AI, including new LLMs, image generation models, and more
In February 2024 we added 8 models for text generation, classification, and code generation use cases. Today, we’re back with 17 more models, focused on enabling new types of tasks and use cases
...
February 06, 2024
Adding new LLMs, text classification and code generation models to the Workers AI catalog
Workers AI is now bigger and better with 8 new models and improved model performance...
December 06, 2023
How we used OpenBMC to support AI inference on GPUs around the world
This is what Cloudflare has been able to do so far with OpenBMC with respect to our GPU-equipped servers...