TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Show HN: 1.32 Petaflops hardware for local prototyping

5 pointsby brody_slade_ai2 months ago
I’ve been working on Vanta, a scalable AI hardware solution powered by 2–8 NVIDIA RTX 4090s, delivering up to 1.32 petaflops FP32 in a compact form factor.<p>It’s built for startups, developers and researchers to prototype, fine-tune and run models up to 70B parameters locally. So you can own your computer instead of renting.<p>- A 2-GPU setup costs $9k and breaks even in 9 months vs. cloud rental at $0.69&#x2F;hr (ex: RunPod).<p>- The 8-GPU at $40k saves $12k in year one compared to $48k in cloud costs.<p>This can handle different AI framework: TensorFlow, PyTorch, ONNX, CUDA-optimized libraries, VLLM, SGLANG, llama.cpp...<p>I can get it built in a day and shipped out quick. Let me know what you think!

2 comments

kristianp2 months ago
I&#x27;m sure it looks better in person, but the images kind of make it look like a wicker basket. Totally superficial take, I know.
ethantom2 months ago
I’m not really sure about this but would love to undestand how can this run multiple different workloads at once? How fast?