Hey everyone,<p>ML Blocks is a node-based workflow builder to create multi-modal AI workflows without writing any code.<p>You connect blocks that call various visual models like GPT4v, Segment Anything, Dino etc. along with basic image processing blocks like resize, invert color, blur, crop, and several others.<p>The idea is to make it easier to deploy multi-step image processing workflows, without needing to spin up endless custom OpenCV cloud functions to glue together AI models. Usually, even if you're using cloud inference servers like Replicate, you still need to write your own image processing code to pre and post-process images in your pipeline. When you're trying to move fast, that's just unnecessary overhead.<p>With ML Blocks, you can build a workflow and deploy the whole thing as a single API. AFAIK, ML Blocks is the only end-to-end workflow builder built specifically for image processing.<p>If you're curious, our models run on Replicate, HuggingFace & Modal Labs cloud GPUs and we use React Flow for the node UX.
I like the idea of node based image editing. It reminds me of when learning how to write shader code for games and GLSL was waaaay too over my head until I discovered node based shaders, and I didn't have to wrangle with coding and instead focusing experimenting with different nodes.<p>However soon creating a "shader that works" was no longer an issue but how to create X effect using shaders was my next blocker, and luckily there were ton of YouTube tutorials on these, which was very helpful, but this continues to be a pain point even now<p>Since now we are in the age of AI, would it be possible to prompt something like "create me a workflow to take image A, a concept art of a character and convert into into a walking animation sprite sheet with 16 frames for each animation walking up, down, left, right and all diagonal directions" and have it not only generate the result, but a workflow to create the result so it can be edited and tweaked.
Interesting! Always interesting to see the ideas in the air at the same time!<p><a href="https://linzumi.com/" rel="nofollow">https://linzumi.com/</a><p>Definitely think this sort of idea could become the "serverless" equivalent for ml-using apps. I'm curious what you think re: versioning, consumption from various client languages, observability/monitoring/queueing, etc.? Feels like it could grow into a meaningful platform.
I had a chance to play around with the product and I really love the ease of creating a multi step workflow, to the point where I'm sure I can train my marketing team to use it. That being said, is there a way to share these workflow with others...either privately or publicly?
This is amazing! Really helps with those of us who are just getting started building workflows for AI images for ecom.
Are there any plans to add bulk processing capabilities, allowing workflows to run on multiple images automatically without manual UI interactions?
Love the idea, however, it's not clear whether I will get access to a large collection of components for building such workflows or what is currently possible? Would nice to get this info before proceeding with auth.
Hey, this is really cool!<p>A small suggestion, I don't think ML is a memorable term for non-technical people. I would prob. try a different name.