TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Show HN: Nexa SDK – Build powerful and efficient AI apps on edge devices

27 pointsby frednoodle9 months ago
Hey HN! Alex and Zack here from Nexa AI. We&#x27;re excited to share something we&#x27;ve been working on.<p>Our journey began with the Octopus series --- action models for mobile AI agents (<a href="https:&#x2F;&#x2F;huggingface.co&#x2F;NexaAIDev&#x2F;Octopus-v2" rel="nofollow">https:&#x2F;&#x2F;huggingface.co&#x2F;NexaAIDev&#x2F;Octopus-v2</a>). We focused on making sub-billion parameter models excel at function calling, making high accurate and fast function-calling possible on mobile and edge devices. But as we delved into developing full-fledged on-device applications, we hit a roadblock.<p>We realized that optimizing for function calling (tool-use) alone wasn&#x27;t enough. Building powerful on-device AI apps requires a diverse set of tools: language models with domain expertise, speech processing, image generation, embedding models and more. That&#x27;s when we decided to create Nexa SDK --- a comprehensive toolkit that brings together everything developers need to build powerful and efficient AI applications that run entirely on-device.<p>Here&#x27;s what Nexa SDK offers:<p><pre><code> - Support for both ONNX and GGML models. - An integrated conversion engine for making custom GGML Quantized Models for different device hardware requirements. - An inference engine that supports language models, image generation models, TTS, audio generation models, and Vision-Language Models. - An OpenAI-compatible API server with optimization in function calling. - A Streamlit UI for rapid prototyping. - An intuitive CLI for easy model management. - Backend optimizations for latency and power consumption on edge devices. </code></pre> We&#x27;ve designed Nexa SDK to be the go-to solution for developers pushing the boundaries of what&#x27;s possible with on-device AI applications and AI on edge devices.<p>To showcase its capabilities, we&#x27;ve built several demo apps running entirely on your device (<a href="https:&#x2F;&#x2F;github.com&#x2F;NexaAI&#x2F;nexa-sdk&#x2F;tree&#x2F;main&#x2F;examples">https:&#x2F;&#x2F;github.com&#x2F;NexaAI&#x2F;nexa-sdk&#x2F;tree&#x2F;main&#x2F;examples</a>):<p><pre><code> - AI soulmate with uncensored model and audio-in&#x2F;audio-out interaction. - A quick interface for uploading and chatting with PDFs like your personal finance documents. - A meeting transcription app supporting multiple languages and real-time translation. </code></pre> We&#x27;re proud to share that the winner of yesterday&#x27;s (Sep 7) House AGI &quot;AI PC&#x2F; GenAI Goes Local&quot; hackathon used Nexa SDK to build a local semantic image search (<a href="https:&#x2F;&#x2F;github.com&#x2F;asl3&#x2F;deja-view">https:&#x2F;&#x2F;github.com&#x2F;asl3&#x2F;deja-view</a>).<p>But we&#x27;re just getting started! There are lots of exciting developments in our pipeline, and we can&#x27;t wait to share them with you soon!<p>Check it out: (<a href="https:&#x2F;&#x2F;github.com&#x2F;NexaAI&#x2F;nexa-sdk">https:&#x2F;&#x2F;github.com&#x2F;NexaAI&#x2F;nexa-sdk</a>)<p>Docs: (<a href="https:&#x2F;&#x2F;docs.nexaai.com&#x2F;" rel="nofollow">https:&#x2F;&#x2F;docs.nexaai.com&#x2F;</a>)<p>If you&#x27;re excited about the future of on-device AI, we&#x27;d really appreciate your support. A star on our GitHub repo goes a long way in helping us reach more developers!<p>Cheers,<p>Alex &amp; Zack

2 comments

justfadeaway9 months ago
Super excited, cannot wait to try it!
评论 #41482362 未加载
zhiyuan89 months ago
Any project demos?
评论 #41482372 未加载