Runware uses custom hardware and advanced orchestration for fast AI inference

Romain Dillet

1 October 2024 at 0:13 pm·3-min read

Sometimes, a demo is all you need to understand a product. And that’s the case with Runware. If you head over to Runware’s website, enter a prompt and hit enter to generate an image, you’ll be surprised by how quickly Runware generates the image for you — it takes less than a second.

Runware is a newcomer in the AI inference, or generative AI, startup landscape. The company is building its own servers and optimizing the software layer on those servers to remove bottlenecks and improve inference speeds for image generation models. The startup has already secured $3 million in funding from Andreessen Horowitz’s Speedrun, LakeStar's Halo II and Lunar Ventures.

The company doesn’t want to reinvent the wheel. It just wants to make it spin faster. Behind the scenes, Runware manufactures its own servers with as many GPUs as possible on the same motherboard. It has its own custom-made cooling system and manages its own data centers.

When it comes to running AI models on its servers, Runware has optimized the orchestration layer with BIOS and operating system optimizations to improve cold start times. It has developed its own algorithms that allocate interference workloads.

The demo is impressive by itself. Now, the company wants to use all this work in research and development and turn it into a business.

Unlike many GPU hosting companies, Runware isn’t going to rent its GPUs based on GPU time. Instead, it believes companies should be encouraged to speed up workloads. That’s why Runware is offering an image generation API with a traditional cost-per-API-call fee structure. It's based on popular AI models from Flux and Stable Diffusion.

“If you look at Together AI, Replicate, Hugging Face — all of them -- they are selling compute based on GPU time,” co-founder and CEO Flaviu Radulescu told TechCrunch. "If you compare the amount of time it takes for us to make an image versus them. And then you compare the pricing, you will see that we are so much cheaper, so much faster."

“It's going to be impossible for them to match this performance,” he added. "Especially in a cloud provider, you have to run on a virtualized environment, which adds additional delays."

As Runware is looking at the entire inference pipeline, and optimizing hardware and software, the company hopes that it will be able to use GPUs from multiple vendors in the near future. This has been an important endeavor for several startups as Nvidia is the clear leader in the GPU space, which means that Nvidia GPUs tend to be quite expensive.

“Right now, we use just Nvidia GPUs. But this should be an abstraction of the software layer,” Radulescu said. "We can switch a model from GPU memory in and out very, very fast, which allow us to put multiple customers on the same GPUs.

"So we are not like our competitors. They just load a model into the GPU and then the GPU does a very specific type of task. In our case, we've developed this software solution, which allow us to switch a model in the GPU memory as we do inference.“

If AMD and other GPU vendors can create compatibility layers that work with typical AI workloads, Runware is well positioned to build a hybrid cloud that would rely on GPUs from multiple vendors. And that will certainly help if it wants to remain cheaper than competitors at AI inference.

PA Media: Movies
Halyna Hutchins’ mother refuses to attend world premiere of Rust in Poland
Rust will debut at the Camerimage Festival in Poland on Wednesday, three years after Ms Hutchins was shot and killed during production.
Yahoo Movies UK
Will there be a M3GAN 2?
The killer dancing doll is back for a horror sequel in 2025.
PA Media: Movies
Jeremy Renner says he could see left eyeball with right in snowplough accident
The Hollywood star said he could remember his ‘head cracking’ and leg being ‘twisted up like a pretzel’ in the incident.
Yahoo Movies UK
Will there be a Mamma Mia 3?
Everything we know about the long-awaited musical threequel.
Yahoo Movies UK
Vic Flick, the man behind James Bond's legendary guitar riff, dies aged 87
The musician, who played on the Dr No soundtrack with The John Barry Seven in 1962, also backed some of the biggest recording stars in the world.
Yahoo Movies UK
How Wicked connects to The Wizard of Oz timeline
There are many ways the musical movie connects to L. Frank Baum's children's novel and the iconic Judy Garland movie The Wizard of Oz.
Yahoo Movies UK
'Gladiator 2 should have been the gayest blockbuster of 2024'
Paul Mescal has said the sequel isn't just for the bros, it's also for the girls, the gays and everyone in between. But let's be real, it's mostly for the gays.
PA Media: Movies
Wicked stars pay homage to original Broadway production at UK premiere
British stars including Amanda Holden, Leigh-Anne Pinnock and Olly Alexander flocked to the London premiere.
Yahoo Movies UK
2024 is the year horror movies came out on top
From Alien: Romulus to Smile 2 and Longlegs, the crowded space of 2024 horror movies has yielded some enormous box office successes.
Yahoo Movies UK
Gladiator 2 history consultant hits back at inaccuracy claims
History consultant Alexander Mariotti speaks with Yahoo UK about the criticism levelled at the sequel, and calls Ridley Scott 'an artist not a historian'.
Yahoo Movies UK
Why is Wicked being released in two parts?
We’ll soon be whisked off to Oz as stage phenomenon Wicked hits cinemas. Here’s all you need to know about its release date, cast, plot and more.
Yahoo Movies UK
What's happening with Kevin Costner's Horizon movies?
Kevin Costner has pumped a lot of his own cash into his Horizon: An American Saga franchise. But is that enough to save his Western movies?
PA Media: Movies
Paul Mescal speaks out on viral Saoirse Ronan clip: She hit the nail on the head
They appeared on The Graham Norton Show, where Ronan interjected on women’s safety.
Yahoo Movies UK
Ridley Scott brings brother Tony 'with him' in every film, Denzel Washington says
Gladiator II's Denzel Washington speaks to Yahoo UK about working with the late Tony Scott over five films, and reuniting with his brother Ridley Scott.
Yahoo Movies UK
Everything we know about the How to Train Your Dragon remake
The How to Train Your Dragon film franchise is set to get a live-action remake, directed by the filmmaker who made the original animated trilogy.
Yahoo Movies UK
What happened to Jean Purdy? The true story behind Netflix's Joy
James Norton and Thomasin McKenzie star in Joy, Netflix's new biopic about the work of pioneers in fertility treatment including Jean Purdy.
Yahoo Movies UK
When are the 2025 Oscars? What we know as Conan O’Brien named as host
Hollywood’s biggest night gets a sequel, its 97th to be precise. Here's what we know so far about the 2025 Oscars.
PA Media: Movies
Bohemian Rhapsody star Lucy Boynton switches on Bond Street Christmas lights
The festive lights, a tradition for over 60 years, have been illuminated with a Chanel No. 5 installation.
PA Media: Movies
Cynthia Erivo says feeling the ‘odd one out’ drew her to Elphaba role in Wicked
Wicked will be released in the UK on November 22.
PA Media: Movies
Cameron Diaz is Back In Action after decade-long acting hiatus
The film is slated for release on January 17 2025.

Latest stories