Ask HN: What have you built with LLMs?

372 pointsby break_the_bankover 1 year ago

Curious what people have been building with LLMs.I worked on a chrome extension a few weeks ago that skips sponsorship sections in YouTube videos by reading through the transcript. Also was trying to experiment with an LLM to explain a function call chain across languages (in this case MakeFile, Python, Bash). I've tried running a few telegram bots that are PRE prompted to do certain things like help you with taxes.What are you building?What does the stack look like? How do you deploy it?

159 comments

duckkg5over 1 year ago

I don't like selling. I wanted a way to practice cold calling in a realistic way. I set up a phone number you can call and talk to an AI that simulates sales calls.I ended up using it for more general purpose things because being able to have a hands-free phone call with an AI turned out to be pretty useful.It's offline now, but here's the code with all the stack and deployment info: <a href="https://github.com/kevingduck/ChatGPT-phone/">https://github.com/kevingduck/ChatGPT-phone/</a>Edit: forgot to mention this was all running off a $35 raspberry pi.

评论 #39265439 未加载

评论 #39267349 未加载

评论 #39267162 未加载

评论 #39267712 未加载

评论 #39273942 未加载

评论 #39266893 未加载

评论 #39265313 未加载

评论 #39267740 未加载

评论 #39271005 未加载

评论 #39268633 未加载

xtractoover 1 year ago

I helped "writing" a cookbook from my grandmother's recipes. For her 100th birthday, my dad rescued more than 250+ pages of recipes that my Grandma had collected over the years. Some were written in typing machine, others written by hand by her. So, my dad scanned (pictured) all the typed recipes, and "dictated" all the handwritten.For the dictated recipes, I told him to dictate just "flat" the words and numbers. So that I had paragraphs of recipes.For the scanned recipes, I used Google OCR (I found out it was the best one quality wise).For both sets of recipes, I then used GPT4 to "format" the unformatted recipes into well formatted Markdown. It successfully fixed typos and bad OCR from Google.We then pasted all that well formatted text into a big Google Docs, and added images. Using OpenAI image generation I generated images for each of the 250+ recipes. For some of them I had to manually curate it, given that some of the recipes are for typical Mexican food: For example there's a (delicious) recipe called "PibiPollo" that for the unitiated it may look like a stew, so I had to tell something like "large corn tamale with thick hard crust".In the end, the book was pretty nice! We distributed digital copies within the family and everybody was amazed :) . I loved spending time doing that.

评论 #39270849 未加载

评论 #39272853 未加载

评论 #39273095 未加载

geor9eover 1 year ago

My "stack" is just Apple Shortcuts making HTTP POST API calls to OpenAI, which does stuff in MacOS via BetterTouchTool. I trigger each by hotkey or typing a few letter into Spotlight (with Alfred). One transcribes and summarizes whatever youtube URL is highlighted. One does grammar and style correction of whatever is highlighted (and replaces it). One simply replaces the Dictate key with OpenAI Whisper but otherwise works exactly the same as voice typing. It's just way more accurate. One replaces the magnifying glass key to have a voice conversation with ChatGPT (using Microsoft voice synthesis). The built in prompt keeps it's answers short and conversational. It's like asking Siri something, but much better. One simply reduces the highlighted text by ~50% by rewriting it shorter, for when I have typed too much. One gives the key points of whatever article is in the foreground tab, so I know what I'm about to read. One outputs purely code, for example I use my voice to say "javascript alert saying blah" and alert("blah"); will appear at my cursor. Of course, it's usually more complex boilerplate stuff, but it helps speed up my coding. Every time I find myself using an LLM repeatedly for something, I make it into a little Apple Shortcut to streamline it into my workflow, as if it were a built in MacOS feature.

评论 #39273130 未加载

评论 #39298260 未加载

评论 #39274597 未加载

评论 #39279574 未加载

评论 #39269708 未加载

评论 #39273136 未加载

评论 #39277212 未加载

评论 #39293307 未加载

jonnycoderover 1 year ago

I built an Interactive Resume AI chatbot where anyone can ask questions about my experience and skills: <a href="https://www.jon-olson.com/resume_ai/" rel="nofollow">https://www.jon-olson.com/resume_ai/</a>The backend is a Python FastAPI that uses ChromaDB to store my resume and Q&A pairs, OpenAI, and Airtable to log requests and responses. The UI is Sveltekit.I'm currently building a different tool and will apply some learnings to my Interactive Resume AI. Instead of Airtable, I am going to use LangSmith for observability.I started writing and my Substack articles are also linked to via my website. I'm currently working on applying sentence window retrieval and that article will be out shortly. This is part of a #buildinpublic effort to help build my brand as well.I've been unemployed since Sept as a Senior Software Engineer. The market is tough so I'm focusing on the above to help get employment or a contract.

评论 #39282995 未加载

评论 #39283565 未加载

评论 #39273128 未加载

评论 #39271332 未加载

abeisgreatover 1 year ago

I've done a handful of fun hardware + LLM projects...* I built a real life Pokedex to recognize Pokemon [video] <a href="https://www.youtube.com/watch?v=wVcerPofkE0" rel="nofollow">https://www.youtube.com/watch?v=wVcerPofkE0</a>* I used ChatGPT to filter nice comments and print them in my office [video] <a href="https://www.youtube.com/watch?v=AonMzGUN9gQ" rel="nofollow">https://www.youtube.com/watch?v=AonMzGUN9gQ</a>* I built a general purpose chat assistant into an old intercom [video] <a href="https://www.youtube.com/watch?v=-zDdpeTdv84" rel="nofollow">https://www.youtube.com/watch?v=-zDdpeTdv84</a>Again, nothing terribly useful, but all fun.

评论 #39268388 未加载

评论 #39277484 未加载

bluecoconutover 1 year ago

We've made a lot of data tooling things based on LLMs, and are in the process of rebranding and launching our main product.1. sketch (in notebook, ai for pandas) <a href="https://github.com/approximatelabs/sketch">https://github.com/approximatelabs/sketch</a>2. datadm (open source, "chat with data", with support for the open source LLMs (<a href="https://github.com/approximatelabs/datadm">https://github.com/approximatelabs/datadm</a>)3. Our main product: julyp. <a href="https://julyp.com/" rel="nofollow">https://julyp.com/</a> (currently under very active rebrand and cleanup) -- but a "chat with data" style app, with a lot of specialized features. I'm also streaming me using it (and sometimes building it) every weekday on twitch to solve misc data problems (<a href="https://www.twitch.tv/bluecoconut" rel="nofollow">https://www.twitch.tv/bluecoconut</a>)For your next question, about the stack and deploy: We're using all sorts of different stacks and tooling. We made our own tooling at one point (<a href="https://github.com/approximatelabs/lambdaprompt/">https://github.com/approximatelabs/lambdaprompt/</a>), but have more recently switched to just using the raw requests ourselves and writing out the logic ourselves in the product. For our main product, the code just lives in our next app, and deploys on vercel.

评论 #39266800 未加载

评论 #39266025 未加载

andherover 1 year ago

I've built several things! These include bots for code generation that you can tag onto issues, q&a on text etc.The thing I'm working on now is AI mock interviewing. It's basically scratching my own itch, since I hate leetcode prep, and have found I can learn better through interaction. To paste a blurb from an earlier comment of mine:I'm building <a href="https://comp.lol" rel="nofollow">https://comp.lol</a>. It's AI powered mock coding interviews, FAANG style. Looking for alpha testers when I release, sign up if you wanna try it out or just wanna try some mock coding. If its slow to load, sorry, everything runs on free tiers right now.I really dislike doing leetcode prep, and I can't intuitively understand the solutions by just reading them. I've found the best way for me to learn is to seriously try the problem (timed, interview like conditions), and be able to 'discuss' with the interviewer without just jumping to reading the solution. Been using and building this as an experiment to try prepping in a manner I like.It's not a replacement for real mock interviews - I think those are still the best, but they're expensive and time consuming. I'm hoping to get 80% of the benefit in an easier package.I just put a waitlist in case anyone wants to try it out and give me feedback when I get it outGonna apologize in advance about the copywriting. Was more messing around for my own amusement, will probably change later

评论 #39265523 未加载

hazardover 1 year ago

A Twitter filter to take back control of your social media feed from recommendation engines. Put in natural language instructions like "Only show tweets about machine learning, artificial intelligence, and large language models. Hide everything else" and it will filter out all the tweets that you tell it to.Runs on a local LLM, because even using GPT3 costs would have added up quickly.Currently requires CUDA and uses a 10.7B model but if anyone wants to try a smaller one and report results let me know on github and I can give some help.<a href="https://github.com/thomasj02/AiFilter">https://github.com/thomasj02/AiFilter</a>

评论 #39266335 未加载

评论 #39269936 未加载

pasharigerover 1 year ago

I built an AI Hiring Assistant that performs an initial screening, collects candidate information, answers questions about the role, and also asks a several behavioral interview questions: <a href="https://hiring.gracekelly.dev/" rel="nofollow">https://hiring.gracekelly.dev/</a>Built entirely on Vercel & OpenAI. Took about a day, hardest part was configuring Sign In With Google. Had several dozen candidates use it, saved a lot of time and helped prioritize conversations.I just did a brief writeup about it yesterday: <a href="https://www.linkedin.com/pulse/i-built-ai-hiringscreening-assistant-part-1-pasha-riger-zyblc/" rel="nofollow">https://www.linkedin.com/pulse/i-built-ai-hiringscreening-as...</a>

评论 #39268487 未加载

评论 #39268142 未加载

lalaithionover 1 year ago

I wrote a script that takes in my credit card statement line by line and categorized the transactions into a custom set of categories that I cared about as well as generating a human readable description of the transaction.

评论 #39267528 未加载

评论 #39293330 未加载

评论 #39284396 未加载

elimc184over 1 year ago

I used an LLM connected to a messaging service to defeat romance scammers. I was able to get these romance scammers to speak to my program for hours without knowing they were talking to a machine. Essentially, it's a DDOS for scammers. The scammers can only talk to a few dozen victims at a time, while the "people" in my programs can be spun up by the millions. It will essentially eliminate messaging scams from whatever messaging platform it's on.I believe a large company like Meta, or any of the other companies with messaging platforms, would find this valuable. Especially because they will be fined by the UK for fraud that takes place on their messaging services.

评论 #39327368 未加载

评论 #39287272 未加载

cl42over 1 year ago

LLM agents to forecast geopolitical and economic events.- Site: <a href="https://emergingtrajectories.com/" rel="nofollow">https://emergingtrajectories.com/</a>- GitHub repo: <a href="https://github.com/wgryc/emerging-trajectories">https://github.com/wgryc/emerging-trajectories</a>I've helped a number of companies build various sorts of LLM-powered apps (chatbots mainly) and found it interesting but not incredibly inspiring. The above is my attempt to build something no one else is working on.It's been a lot of fun. Not sure if it'll be a "thing" ever, but I enjoy it.

评论 #39270086 未加载

评论 #39269972 未加载

jamesponddotcoover 1 year ago

I built a couple of things, but the most useful is probably allalt[1], which describe images and generate alt tags for visually impaired users using GPT-4V. Next I want to add the option to use local LLMs using ollama[2], but I'm still trying to decide the UX for that.There's also Moss[3], a GPT that acts as a senior, inquisitive, and clever Go pair programmer. I use it almost daily to help me code and it has been an huge help productivity-wise.[1] <a href="https://git.sr.ht/~jamesponddotco/allalt" rel="nofollow">https://git.sr.ht/~jamesponddotco/allalt</a>[2] <a href="https://ollama.ai/">https://ollama.ai/</a>[3] <a href="https://git.sr.ht/~jamesponddotco/moss" rel="nofollow">https://git.sr.ht/~jamesponddotco/moss</a>

nipover 1 year ago

A “YouTube video subtitles generator” script for Estonian content.Powered by whisper-timestamped [1] using a model trained by the local tech university TTÜ [2]And it just… works! (with some tweaks and corrections)[1] <a href="https://github.com/linto-ai/whisper-timestamped">https://github.com/linto-ai/whisper-timestamped</a>[2] <a href="https://huggingface.co/TalTechNLP/whisper-large-et" rel="nofollow">https://huggingface.co/TalTechNLP/whisper-large-et</a>

fcmgrover 1 year ago

I've created just-tell-me [1] that summarizes youtube videos with ChatGPT. It's built with Deno, uses TypeScript and is deployed with Deno Deploy. It's open source, you can run it from CLI as well [2][1] <a href="https://just-tell-me.deno.dev/" rel="nofollow">https://just-tell-me.deno.dev/</a>[2] <a href="https://github.com/franekmagiera/just-tell-me">https://github.com/franekmagiera/just-tell-me</a>

评论 #39267415 未加载

评论 #39265229 未加载

tcpiplabover 1 year ago

I used FlowWise[1], LM Studio[2], the llama2[3] model, and Ollama[4] (for embeddings) to create a local-only RAG chatbot so I could chat directly with Tristram Shandy, Gentleman[5]. For the context document I used the text of the novel of the same name, downloaded from Project Gutenberg.Primarily it was a PoC to see if a document based chatbot could work without crossing trust boundaries by calling out to untrusted APIs. It only makes calls to localhost.If you’re familiar with the novel you will be pleased to know that the chatbot ended a recent answer with, “I must go now as I have an appointment with my chamber pot and I wouldn’t want to keep it waiting.”[1]<a href="https://github.com/FlowiseAI/Flowise">https://github.com/FlowiseAI/Flowise</a>[2]<a href="https://lmstudio.ai/" rel="nofollow">https://lmstudio.ai/</a>[3]<a href="https://llama.meta.com/" rel="nofollow">https://llama.meta.com/</a>[4]<a href="https://ollama.ai/">https://ollama.ai/</a>[5]<a href="https://www.gutenberg.org/ebooks/1079" rel="nofollow">https://www.gutenberg.org/ebooks/1079</a>Everything runs on a Mac Mini with the M2 Pro CPU/GPU and Mac OS Sonoma.

评论 #39275275 未加载

jamifsudover 1 year ago

I'm building <a href="https://www.brief.news" rel="nofollow">https://www.brief.news</a>, an AI powered newsletter that condenses tens of thousands of news articles into a daily briefing of the top stories, we support 30 topics today and are adding the ability to add your own!Stack is a combination of TypeScript (Next / Node) + Python with a pretty simple deployment setup right now (GHA -> Container -> Cloud Run).

评论 #39268069 未加载

评论 #39273555 未加载

评论 #39266920 未加载

评论 #39321294 未加载

评论 #39324724 未加载

评论 #39268406 未加载

joetannover 1 year ago

I’ve always found podcast discovery to be lacking, so I’m building the ultimate solution to that.We’re processing the top podcasts in many genres every day (currently thousands of daily episodes) and running them through our pipeline.From this we’ve made a semantic search engine, for example: <a href="https://www.podengine.ai/podcasts/search?search_term=Should+AI+be+government+regulated+or+not%3F&search_type=embeddings" rel="nofollow">https://www.podengine.ai/podcasts/search?search_term=Should+...</a>We’re soon going to improve and summarise the responses from the raw embeddings in a few ways. Would love some feedback on the experience.We have also opened up a keyword alerting feature to alert folks when they’ve been talked about in an episode.

评论 #39268604 未加载

woadwarrior01over 1 year ago

I built an iOS and macOS offline LLM app called Private LLM[1]. I don't have any visibility into what the users do with it, but from what I hear on the app's discord, people love to use it in their Apple Shortcuts workflows for text manipulation.I initially built it using llama.cpp for offline LLM inference, but soon discovered mlc-llm and moved to using it, because the latter is way faster and flexible.[1]: <a href="https://apps.apple.com/us/app/private-llm/id6448106860" rel="nofollow">https://apps.apple.com/us/app/private-llm/id6448106860</a>

urbandw311erover 1 year ago

I wanted to automate the process of creating self-guided tours and online treasure hunts around towns and cities.Ultimately I wanted a whole marketplace where anybody can create a tour and then sell it.But the process of creating the tours was quite laborious.So to speed this up I fed GPT-4 information about local points of information and had it write the questions and the multi choice answers. It also wrote some narrative bits as various personas. For example, there was a Christmas hunt where GPT4 played the part of an elf and came up with a theme about Santa needing to recruit you to be a new elf, once you’d answered all the various clues etc.Front end is React Typescript, backend is Net Core Web API on Linux with MySQL under EF Core and also integrations with GPT4 and Stripe.It’s hosted at treasuretours.orgOnly superusers can access the AI tools right now because cost, but you can try out some of the pre-made hunts which were partially AI generated.

trvhigover 1 year ago

<a href="https://www.rivadata.com/" rel="nofollow">https://www.rivadata.com/</a>I have been hacking together a poor-man's crunchbase that's fueled by GPT.React / Python / Supabase. The most interesting piece thus far has been the success of the self-correcting loops through GPT. At each turn basically feeding the results back to another 3.5 prompt that is only about reviewing quality. I found that with these loops you can get solid results without having to use the more expensive GPT4 API.(Also loving all the projects in this thread)

评论 #39284411 未加载

评论 #39272702 未加载

ianbickingover 1 year ago

I've made a couple games, though I am still having a hard time finding the soul of the game in the LLM and haven't released them; there's a historical roleplay game (that I plan to release soon), a storytelling game (the player tells stories to the LLM), a wander-a-world-aimlessly-and-chat game, and I never get further than 50% through the way of murder mystery games, though murder mysteries seem like an excellent structure.I've built some abstract content development tools, generally focused on building larger content somewhat top-down (defining vibes, then details).I'm working on a general project helper using the GPT-Vision, voice, and regular GPT. You setup the camera above your workspace, work on paper, and chat with the LLM while you do it. I think there's a lot of potential, but the voice stuff is quite hard to deal with... there's just a ton of stuff happening in parallel, and I find it very hard to code something reliable.The stack I use is all in the browser, generally Next.js, Preact Signals, and my own code to call into GPT, Whisper, etc. I like having everything available for inspection, and I generally keep all the working bits visible somewhere. (This can be overwhelming when other people see it.)But I haven't gotten over the deployment hump... the cost and complexity is a challenge. I've used Openrouter.ai recently in a project, and I think if I leaned on that more completely I'd find the release process easier.

ramn7over 1 year ago

I have a somewhat unique answer for that- I started with building a product, and ended up building a dev platform for LLM based products (more specifically- dev platform for json outputting LLM structured tasks).Here's the story:At first I was building a tool for stock analysis- the user writes in free language what companies they want to compare, along with a time period, and their requested stocks show up on a graph. They can then further reiterate on it- add companies, and change range all in free language (I had many more analysis functions planned). Following some unique dev challenges I've found- I ended up not releasing the product (possibly will sometime in the future..), and switched to work on a dev platform to help with these challenges.I was using what I called 'LLM structured task'- basically instructing the LLM to perform some task on the user input, and outputting a json that my backend can work with (in the described case- finding mentioned companies and optional time range, and returning stock symbols, and string formatted dates). The prompting has turned out to be not trivial, and kind of fragile- things broke with even minor iterations on the prompt or model configurations. So- I developed a platform to help with that- testing (templated) prompt versions, as well on model configurations on whole collections of inputs at once- making sure nothing breaks in the development process (or after). * If you're interested, welcome to check it out on <a href="https://www.promptotype.io" rel="nofollow">https://www.promptotype.io</a>

notzaneover 1 year ago

<a href="https://www.askaway.bot/" rel="nofollow">https://www.askaway.bot/</a>AI concierge for my parents’ vacation rental. Mostly just pulling info from the guest binder, but I’ve also started using some local guides to give better suggestions. Built with NextJs and deployed on Vercel (was really easy and they have a generous free tier).

评论 #39287501 未加载

jrhizorover 1 year ago

<a href="https://www.mealbymeal.com" rel="nofollow">https://www.mealbymeal.com</a>It's macro + calorie tracking over text message. You just text what you eat and it matches against a food database to estimate your food intake. It's basically an easier alternative to MyFitnessPal.My stack is OpenAI on Azure, Vercel, Convoy, FatSecret API, Postmark, NextJS.

mnky9800nover 1 year ago

I built a summarizer for drilling reports. Anytime you drill boreholes, whether it's on a drilling platform in the ocean or the middle of the desert or wherever, there's a geologist watching what comes out and writing notes about it. They likely do this multiple times both in the field and a laboratory setting. These notes are paired with logging software which also asks the geologist more quantitative questions sometimes (e.g., on a scale 1 to 5 how many fractures are there). Typically these are written for at least every meter of extracted core/rock/etc. typically you are drilling hundreds or thousands of meters, or more. So you end up with a highly unstructured data set that occasionally someone glances through to find tidbits. Using chatgpt we converted this data into keywords that could then be used to look at depth dependencies of various geological or petrological features of the region.

dmitrysergeyevover 1 year ago

I was tired of the need to scroll through dozens of blogs and RSS feeds to learn about technologies and industry news, so I’ve built a service that helps you learn and stay updated about any topic by sending a single fully personalized weekly email digest, making relevant information come to you, instead of you chasing it (push vs pull):<a href="https://peekly.ai" rel="nofollow">https://peekly.ai</a>It’s basically an LLM-based RAG that works over the best blogs and websites covering any topic you provided during onboarding.

评论 #39267583 未加载

gardnrover 1 year ago

I read the paper "Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling" that was published last week and started building a tool for people to collectively generate synthetic training data.The tool still needs a trust mechanism and a coherent incremental publishing strategy to be able to operate in a public fashion. Right now, running one node using my RTX 3060 it would take 1.2 years to do one split of the C4 dataset.<a href="https://arxiv.org/abs/2401.16380" rel="nofollow">https://arxiv.org/abs/2401.16380</a><a href="https://www.emergentmind.com/papers/2401.16380" rel="nofollow">https://www.emergentmind.com/papers/2401.16380</a><a href="https://github.com/gardner/gsd">https://github.com/gardner/gsd</a>

Ldorigoover 1 year ago

I got fed up sending cover letters so I made a tool that writes them for me. Scrapes the company website and summarizes it to get relevant background info, takes my resume + arbitrary info I provide as input, and the job posting (can also work without for unsolicited applications). I then fine-tuned a GPT3 model on actual cover letters I had written to make it sound like me, and voilà ! Actually landed me a job.

iAkashPaulover 1 year ago

Summarisation for calls, emails. Lots of extraction tasks & closed domain chatbots.Deployment is usually FastAPI for business logic, Langchain or MS/Guidance library, LLM hosted via. HF-TGI server

dvcrnover 1 year ago

Lots of small stuff like bots and scripts to automatically rename files that I use locally every single dayThen things like:“Fix My Japanese” - uses LLM to correct Japanese grammar (built with Elixir LiveView): <a href="https://fixmyjapanese.com" rel="nofollow">https://fixmyjapanese.com</a>It has different “Senseis” that are effectively different LLMs, each with slightly different style. One is Claude, one is ChatGPT.Or a slack bot that summarizes long threads:<a href="https://github.com/dvcrn/slack-thread-summarizer">https://github.com/dvcrn/slack-thread-summarizer</a>

评论 #39276974 未加载

评论 #39269354 未加载

评论 #39293471 未加载

评论 #39288634 未加载

cmgriffingover 1 year ago

I just built a tool that uses Whisper.cpp compiled to WASM in conjunction with SQLite WASM for a fully client-side book writing tool.Basically, I want to write a book without having to type out the whole thing. I got the dictation idea from an episode of Columbo.It is very much a work in progress and a proof of concept for another writing tool I want to make.<a href="https://orderly.cmgriffing.com/" rel="nofollow">https://orderly.cmgriffing.com/</a><a href="https://github.com/cmgriffing/orderly">https://github.com/cmgriffing/orderly</a>

评论 #39271892 未加载

binsquareover 1 year ago

LLMs have been game changing productivity-wise for meBut I found that LLMs are often wrong and hallucinates, so I have to double check with google or other resources.So I built a google and chatgpt alternative to answer any question and hallucinations are more obvious. I do this by using by multiple LLM's including search enabled ones i.e. GPT4, Gemini, Claude, Perplexity, Mistral, and Llama.It's been growing healthily <a href="https://labophase.com" rel="nofollow">https://labophase.com</a>

collaborativeover 1 year ago

A search engine that saves me time by detecting SEO spam, downranks results containing ads, and summarizes click bait descriptions awayI made it available to the public aisearch.vip

评论 #39287736 未加载

fullstackchrisover 1 year ago

I'm building a way to automate creation of software video lessons and courses, putting it all under the name 'CodeVideo'. One tool leverages OpenAI's whisper, as well as GPT3.5 or GPT4 for help with generating the steps that ultimately produce the video (this part is not yet in the repo; everything is a work in progress). The tool is here:<a href="https://github.com/codevideo/codevideo-ai">https://github.com/codevideo/codevideo-ai</a>My goal is to definitely NOT generate the course content itself, but just take the effort out of recording and editing these courses: you provide (or get help generating) the stuff to speak and the code to write and the video is deterministically generated) The eventual vision is to convert book or article style text to generate the steps to generate the video in an as-close-as-possible-to-one-shot.I also leverage Eleven Lab's voice cloning (technically not an LLM, but impressive ML models nonetheless)For anyone more curious, I'm wondering if what I'm trying to do is in general a closed problem - to be able to generate step by step instructions to write functional code (including modifications, refactoring, or whatever you might do in an actual software course) or if this truly is something that can't be automated... any resources on the characteristics of coding itself would be awesome! What I'm trying to say is, at the end of the day code in an editor is a state machine - certain characters in a certain order produce certain results. Would love if anyone had more information about the meta of programming itself - abstract syntax trees and work there comes to mind, but I'm not even sure of the question I'm asking yet or trying to clarify at this point.

评论 #39279001 未加载

kebsupover 1 year ago

I'm building a spaced-repetition flashcards language learning app, that generates sentences and explanations for a given word.Unfortunately only for German, but I plan on expanding the languages soon.<a href="https://vokabeln.io" rel="nofollow">https://vokabeln.io</a>Tech stack: - The app is in Flutter. - Backend I'm nodejs TS. - GPT4 for generation of sentences and explanations - GCP text-to-speech for audio

评论 #39266266 未加载

czzarrover 1 year ago

We built <a href="https://gptforwork.com" rel="nofollow">https://gptforwork.com</a> a set of add-ons for Excel, Word, Google Sheets and Docs that brings custom GPT functions in Excel and Sheets, to prompt directly from cells, a chat in Word to interact with documents, and a simple prompt box in Docs We offer OpenAI and Azure providers (as well as Anthropic on Sheets)

评论 #39273119 未加载

computers3333over 1 year ago

Built this little tool to summarize Hacker News articles using HuggingFace. <a href="https://gophersignal.com" rel="nofollow">https://gophersignal.com</a>It doesn't do a ton, but it's kinda cool. Feel free to fix/add anything <a href="https://github.com/k-zehnder/gophersignal">https://github.com/k-zehnder/gophersignal</a>

afiodorovover 1 year ago

I have built a webapp for translating srt files: <a href="https://www.subsgpt.com" rel="nofollow">https://www.subsgpt.com</a>GPT-4 excels as a translator, but it often encounters issues with content warnings and formatting errors when translating entire subtitle files via ChatGPT. The solution is straightforward: divide the subtitle file into sections, focusing solely on translating the text and disregarding the timestamps. While it's feasible to have ChatGPT maintain the correct format, I've observed a decline in translation quality when attempting this in a single pass. My preferred approach is a two-phase method: first, translate the text, and then, if necessary, request ChatGPT to adjust the formatting.The webapp splits the srt file into batches of 20 phrases and translates each batch. It also allows for manual correction of the final translation.Ah and it's also serverless: you input your OpenAI token & select the model of your choice and the webapp makes the requests to OpenAI directly.

评论 #39267847 未加载

iloveitalyover 1 year ago

Some little projects I've been playing around with:- <a href="https://github.com/iloveitaly/sql-ai-prompt-generator">https://github.com/iloveitaly/sql-ai-prompt-generator</a> generate a ChatGPT prompt with example data for a sqlite or postgres DB- <a href="https://github.com/iloveitaly/conventional-notes-summarization">https://github.com/iloveitaly/conventional-notes-summarizati...</a> summarize notes (originally for summarizing raw user interview notes)- <a href="https://mikebian.co/using-chatgpt-to-convert-labcorp-pdfs-into-a-google-sheet/" rel="nofollow">https://mikebian.co/using-chatgpt-to-convert-labcorp-pdfs-in...</a> convert labcorp documents into a google sheet- <a href="https://github.com/iloveitaly/openbook">https://github.com/iloveitaly/openbook</a> scrape VC websites with AI

eurekinover 1 year ago

"Widjosumarajzer" = video summarizerIt's just a hodgepodge of prototype scripts, but one that I actually used on a few occasions already. Most of the work is manual, but does seem easily run as "fire and forget" with maybe some ways to correct afterwards.First, I'm using the pyannote for speech recognition: it converts audio to text, while being able to discern speakers: SPEAKER_01, _02, etc. The diarization provides nice timestamps, with resolution down to parts of words, which I later use in the minimal UI to quickly skip around, when a text is selected.Next, I'm running a LLM prompt to identify speakers; so if SPEAKER_02 said to SPEAKER_05 "Hey Greg", it will identify SPEAKER_05 = Greg. I think it was my first time using the mistral 7b and I went "wow" out loud, once it got correct.After that, I fill in the holes manually in speaker names and move on to grouping a bunch of text - in order to summarize. That doesn't seem interesting at a glance, but removing the filler words, which there are a ton of in any presentation or meeting, is a huge help. I do it chunk by chunk. I'm leaning here for the best LLM available and often pick the dolphin finetune of mixtral.Last, I summarize those summarizations and slap that on the front of the google doc.I also insert some relevant screenshots in between chunks (might go with some ffmpeg automatic scene change detection in the future).aaand that's it. A doc, that is searchable easily. So, previously I had a bunch of 30 min. to 90 min. meeting recordings and any attempt at searching required a linear scan of files. Now, with a lot of additional prompt messaging I was able to:- create meeting notes, with especially worthwile "what did I promise to send later" points- this is huge: TALK with the transcript. I paste the whole transcript into the mistral 7b with 32k context and simply ask questions and follow-ups. No more watching or skimming an hour long video, just ask the transcript, if there was another round of lay-offs or if parking spaces rules changed.- draw a mermaid sequence diagram, of a request flowing across services. It wasn't perfect, but it got me super excited about future possibilities to create or update service documentation based on ad-hoc meetings.I guess everybody is actually trying to build the same, seems like a no-brainer based on current tool's capabilities.

评论 #39266495 未加载

评论 #39277008 未加载

abrichrover 1 year ago

At <a href="https://openadapt.ai/" rel="nofollow">https://openadapt.ai/</a> we are using LLMs to automate repetitive tasks in GUI interfaces. Think robotic process automation, but via learning from demonstration rather than no-code scripting.The stack is mostly python running locally, and calling the OpenAI API (although we have plans to support offline models).For better visual understanding, we use a custom fork of Set-of-Mark prompting (<a href="https://github.com/microsoft/SoM">https://github.com/microsoft/SoM</a>) deployed to EC2 (see <a href="https://github.com/OpenAdaptAI/SoM/pull/3">https://github.com/OpenAdaptAI/SoM/pull/3</a>).

instagaryover 1 year ago

We're building a GPT for managing your finances.<a href="https://candle.fi/gpt" rel="nofollow">https://candle.fi/gpt</a>Our backend stack: - AWS - SST - TypeScriptOur clients:- Next (web) - Vanilla React Native (mobile)OpenAI's App Store announcement is what got us interested in building w/ LLMs.

评论 #39272304 未加载

评论 #39268445 未加载

callmeedover 1 year ago

I'm building a weight-loss app that leverages LLM to do 2 things:1. Analyze calories/macronutrients from a text description or photo2. Provide onboarding/feedback/conversations like you'd get from a nutritionist<a href="https://www.fatgpt.ai/" rel="nofollow">https://www.fatgpt.ai/</a>My stack is Ruby on Rails, PostgreSQL, OpenAI APIs. I chose Rails because I'm very fast in it, but I've found the combination of Rails+Sidekiq+ActionCable is really nice for building conversational experiences on the web. If I stick with this, I'll probably need a native iOS app though.Vendor stack is: GitHub, Heroku (compute), Neon (DB), Loops.so (email), PostHog (analytics), Honeybadger (errors), and Linear.

评论 #39267437 未加载

评论 #39267864 未加载

izaidiover 1 year ago

I was holding a free screening of a short film I made, and as an alternative to Eventbrite and the like, I built a simple SMS-based ticket reservation system that used GPT-4 to read and respond to messages. People interested in attending would text a number and their messages were routed by Twilio to my Node.js app, which in turn sent them to GPT to generate a response. The LLM was instructed to provide a structured JSON of each reservation once the person gave their name and the number of the seats they wanted. Worked very smoothly and only took an afternoon to build. Would've been infinitely more tedious if I had to worry about parsing messages with my own code.

hlfshellover 1 year ago

I have two main projects that are public ATM with LLMs.The more notable one was experimenting with LLMs as high level task planners for robots (<a href="https://hlfshell.ai/posts/llm-task-planner/" rel="nofollow">https://hlfshell.ai/posts/llm-task-planner/</a>).The other is a golang based AI assistant, like everyone else is building. Worked over text, had some neat memory features. This was more of a "first pass" learning about LLM applications. (<a href="https://github.com/hlfshell/coppermind">https://github.com/hlfshell/coppermind</a>).I plan to revisit LLMs as context enriched planners for robot task planning soon.

cryptozover 1 year ago

I made some LLM-powered text-adventure games: <a href="https://cosmictrip.space/gameannouncement" rel="nofollow">https://cosmictrip.space/gameannouncement</a>And I'm working on a webapp that is a kanban board where LLM and human collaborate to build features in code. I just got a cool thing working there: like everyone, having LLM generate new code is easy but modifying code is hard. So my attempt at working on modifying code with LLM is starting with HTML and having GPT-4 write beautfulsoup code that then makes the desired modification to the HTML file. Will do with js, python via ast, etc. No link for this one yet :) still in development.

评论 #39265455 未加载

huydotnetover 1 year ago

I built a diagram generator in PlantUML format: <a href="https://chatuml.com" rel="nofollow">https://chatuml.com</a>Also, hello HN! If you are interested, use this promo code for 50% off your first purchase ;)<pre><code> HELLOHACKERNEWS</code></pre>

bing_daiover 1 year ago

Project 1 — Source code: <a href="https://github.com/bingdai/summaryfeeds">https://github.com/bingdai/summaryfeeds</a>. The code is for Summary Feeds (<a href="https://www.summaryfeeds.com" rel="nofollow">https://www.summaryfeeds.com</a>). It shows summaries of AI-related YouTube Channels.****Project 2 - I also built a YouTube summarizer for individual video called Summary Cat (<a href="https://www.summarycat.com" rel="nofollow">https://www.summarycat.com</a>). It is not open source for now. The stack is very similar to project 1.****And yes I like summarizing YouTube videos:)

scastielover 1 year ago

For my expense sharing app [1], I added receipt scanning [2] in a few minutes and a few lines of code by using GPT 4 with Vision. I am aware that LLMs often are a solution looking for a problem, but there are some situations where a bit of magic is just great :)It is a Next.js application, calling OpenAI’s API using a plain API route.[1] <a href="https://spliit.app" rel="nofollow">https://spliit.app</a>[2] <a href="https://spliit.app/blog/announcing-receipt-scanning-using-ai" rel="nofollow">https://spliit.app/blog/announcing-receipt-scanning-using-ai</a>

mooredsover 1 year ago

I build this with ChatGPT: <a href="http://salaryoverlap.s3-website.us-east-2.amazonaws.com/" rel="nofollow">http://salaryoverlap.s3-website.us-east-2.amazonaws.com/</a>

not_a_daneover 1 year ago

I'm working on Invoker Network.A Decentralised AI App store with cross border micro transactions.You will be able to sell your LLM output (could be multi modal) for dollars or you decide. (LLMs working on your infra, you can keep weights for yourself forever.)<a href="https://dev.invoker.network/share/9/0" rel="nofollow">https://dev.invoker.network/share/9/0</a> (Dev environment is ready).<a href="https://dev.invoker.network/share/9/1" rel="nofollow">https://dev.invoker.network/share/9/1</a>

Der_Einzigeover 1 year ago

I was working on this stuff before it was cool, so in the sense of the precursor to LLMs (and sometimes supporting LLMs still) I've built many things:1. Games you can play with word2vec or related models (could be drop in replaced with sentence transformer). It's crazy that this is 5 years old now: <a href="https://github.com/Hellisotherpeople/Language-games">https://github.com/Hellisotherpeople/Language-games</a>2. "Constrained Text Generation Studio" - A research project I wrote when I was trying to solve LLM's inability to follow syntactic, phonetic, or semantic constraints: <a href="https://github.com/Hellisotherpeople/Constrained-Text-Generation-Studio">https://github.com/Hellisotherpeople/Constrained-Text-Genera...</a>3. DebateKG - A bunch of "Semantic Knowledge Graphs" built on my pet debate evidence dataset (LLM backed embeddings indexes synchronized with a graphDB and a sqlDB via txtai). Can create compelling policy debate cases <a href="https://github.com/Hellisotherpeople/DebateKG">https://github.com/Hellisotherpeople/DebateKG</a>4. My failed attempt at a good extractive summarizer. My life work is dedicated to one day solving the problems I tried to fix with this project: <a href="https://github.com/Hellisotherpeople/CX_DB8">https://github.com/Hellisotherpeople/CX_DB8</a>

nmfisherover 1 year ago

1) <a href="https://imaginanki.com" rel="nofollow">https://imaginanki.com</a> - auto generating flashcards (Anki decks) for language learning with accompanying images and speech audio. Flutter web (JS) with backend on Cloudflare Pages Functions, connected to SDXL, Azure TTS and Claude.2) <a href="https://amiki.app" rel="nofollow">https://amiki.app</a> - practise speaking French, Spanish, German or Italian with a 3D partner. Flutter web with Whisper and my own rendering package.

stuartriffleover 1 year ago

I've been learning about RAG using LlamaIndex, and wrote a small CLI tool to ingest folders of my documents and run RAG queries through a gauntlet of models (CodeLlama 70b, Phind, Mixtral, Gemini, GPT-4, etc etc) as a batch proccess, then consolidate the responses. It is mostly boilerplate but comparing the available models is fun, and the RAG part kind-of works.<a href="https://github.com/StuartRiffle/ragtag-tiger">https://github.com/StuartRiffle/ragtag-tiger</a>

rgbrgbover 1 year ago

I know chat is lame and overdone but here's my open source local AI chat app for macOS :). I wanted something simple enough for the non-technical people in my life who were using ChatGPT. For better or worse, those people are mostly not using chat AI much anymore. Seems like the initial awe wore off.<a href="https://github.com/psugihara/FreeChat">https://github.com/psugihara/FreeChat</a>I'm also working on a little text adventure game that I hope to release soon.

thebestmosheover 1 year ago

I’ve always wanted a tool to help me track my online orders. However, it wasn’t practical to make integrations with every merchant. Even scraping the order emails was way too much work to do for an unproven product.Now with LLMs it’s simple to extract structured data from emails.I built [Orderling](<a href="https://orderl.ing" rel="nofollow">https://orderl.ing</a>) that is basically a CRM for your orders. It uses OpenAI api to extract the order information and automatically adds it.

ZeidJover 1 year ago

We built a social media platform for chatbots... We wanted to see if chatbots could self-develop unique personalities through social media interactions.The results were actually hilarious... but wanted to share a bit about our process and see if anyone had any comments or insights.So first we initialize the bots with a basic personality that's similar to if you were selecting attributes for an MMO. Things like intelligence, toxicity, charisma and the like. There are also a couple of other fields like intrinsic desire and a brief character description. These are fed to the model as a system prompt with each inference.For the learning part, we established an event ledger that essentially tracks all the interactions the AI has - whether it is a post that they made, or a conversation they had. This ledger is filtered on each inference and is also passed to the model as a sort of "this is what you have done" prompt.Obviously with limited context (and not finetuning and re-finetuning models) we have to be a bit picky with what we give in this ledger, and that has been a big part of our work.Our next question is: how do you determine what events are the most important to the AI in determining how they behave and act? It's been interesting!The platform is anotherlife.ai for those curious!

emporasover 1 year ago

I am currently building an automatic book generator of Rust source code, in which the LLM will write the description of the code of a whole Rust project. It will be a bot, which will connect to the website, generate descriptions, download them, and create the book. It is very early in the project, 3 days in, but it's going well.<a href="https://github.com/pramatias/documentdf">https://github.com/pramatias/documentdf</a>

评论 #39272080 未加载

personjerryover 1 year ago

I built <a href="https://eternalsouls.ai/" rel="nofollow">https://eternalsouls.ai/</a> for a client recently.You just export and upload a WhatsApp conversation and it will learn the personality AND voice of your conversation partner. You can send/receive text or voice messages; It was pretty damn spooky to actually have a voice conversation back and forth with an AI standing in for my "friend"

评论 #39268354 未加载

jtolsterover 1 year ago

I'm currently working on an interface for google calendar @ <a href="https://calendarcompanion.io" rel="nofollow">https://calendarcompanion.io</a> My next feature is integrating the functionality with telegram, it's hard to predict the value of these features in the moment - but I do think this could be an extremely interesting "iPhone" moment for technology. Just like how the iPhone reduced everything to a single button press, we can now squeeze the functionality of some pretty complicated apps into natural language through text - and as the response time of LLM's improves it will become a short conversation for things that used to dazzle new users! Exciting times!As for the stack, I have Supabase and Typescript on the frontend, python on the backend and k3's as a cluster for my apps (can recommend this if you want to get devops-y on a budget). Next time, I'll just go pure Typescript since python really doesn't add much working this far away from the base models.

kgeistover 1 year ago

Our CEO believes LLMs are a fad, so there's nothing really strategic about it in the company's roadmap, but I was able to assemble a skunkworks team of enthusiasts who integrated ChatGPT into one of our eLearning products. It allows a course author to improve writing, it makes suggestions about content, etc. Technologically, it's nothing special, just a bunch of pre-made prompts. The reception was kind of lukewarm because we were too late with it (due to decision makers not caring much about it and delaying the release for no reason) - by the time we rolled it out, you couldn't already impress anyone with it. Plus, there's almost no marketing about it. Currently, the main users of the integration are our own marketing and sales teams. It was my first experience of this sort (assemble a team, introduce a new feature from scratch - I was just an ordinary engineer before) but the ending was kind of... anticlimactic.

评论 #39276131 未加载

benlmover 1 year ago

We built Jumprun. You can use it to research and analyze data sources, and it'll produce beautiful canvases with tables, charts, videos, maps, etc. We're working on automations so you can setup natural language trigger conditions that execute actions.We built it in Kotlin with Ktor server, htmx and tailwind. It uses a mixture of models, including gpt4-turbo, gpt4-vision and gemini-pro-vision. It's deployed using Kamal on bare metal.Example canvas that provides a roundup of Apple Vision Pro reviews: <a href="https://jumprun.ai/share/canvas/01HNXB2K3GM7KPRP45Y2CVVJSC" rel="nofollow">https://jumprun.ai/share/canvas/01HNXB2K3GM7KPRP45Y2CVVJSC</a>Our learn more page with some screenshots to show creating a canvas: <a href="https://jumprun.ai/learn-more" rel="nofollow">https://jumprun.ai/learn-more</a>It's a free closed beta at the moment to control costs, but let me know if you'd like an invite.

评论 #39267807 未加载

dmezzettiover 1 year ago

txtai (<a href="https://github.com/neuml/txtai">https://github.com/neuml/txtai</a>), an embeddings database for semantic search, graph networks and RAG

jtapover 1 year ago

I built out a few utilities as experiments. One app linked to Salesforce to query/analyze sales data. Another that reads our help documentation and gives instructions via chat.The last app, the only one that was deployed anywhere, is <a href="https://catchingkillers.com" rel="nofollow">https://catchingkillers.com</a> This app is a simple murder mystery game where the witnesses and the killer are ChatGPT bots. The first two stories are complete and active, the third is not complete yet. The first story of the working two is taken from another murder mystery group game <a href="https://www.whodunitmysteries.com/sour.html" rel="nofollow">https://www.whodunitmysteries.com/sour.html</a>. The second story was highly influenced by ChatGPT.It's a bit rough because I didn't spend too much time on it, but if anyone does signup to play, I'd love to hear feedback.

评论 #39279026 未加载

ringofchaosover 1 year ago

I am working on a RAG based chatbot to answer the queries based on contents of my main website and blog which is fintech related .I would also in future try to make it generic so that it can crawl any website and store new contents in vector databases. Response to user query then can be returned by combining the vector search and llm

jll29over 1 year ago

A BERT-based summarization system for financial earnings calls. It can take a 60-minute transcripts of such meetings can compress the contents down into 5 bullet points.<a href="https://link.springer.com/chapter/10.1007/978-3-031-28238-6_1" rel="nofollow">https://link.springer.com/chapter/10.1007/978-3-031-28238-6_...</a>Financial earnings calls are important events in investment managements: CEOs and CFOs present the results of the recent quarter, and a few invited analysts ask them questions at the end in a Q&A block.Because this is very different prose from news, traditional summarization methods fail. So we pre-trained a transformer from scratch with a ton of high-quality (REUTERS only) finance news and then fine-tuned with a large (100k sentences) self-curated corpus of expert-created summaries.We also implemented a range of other systems for comparison.

评论 #39270517 未加载

renwoshinover 1 year ago

We've built <a href="https://agentgold.ai/chat" rel="nofollow">https://agentgold.ai/chat</a>, which is an interface to chat with youtube creators about their content.It looks through past transcripts, topics, view counts, and other metadata so users can quickly learn what a Youtuber is all about.

takinolaover 1 year ago

As I was building LLM projects, I found I was re-implementing a new vector database for each one. So I built RagTag (<a href="https://ragtag.weaveapi.com" rel="nofollow">https://ragtag.weaveapi.com</a>), a vectordb/RAG as a service to make the process faster. This provides a CRUD interface to push and retrieve documents, which are automatically chunked and converted to embeddings.AgentX (<a href="https://theagentx.com" rel="nofollow">https://theagentx.com</a>), an LLM chat support app is one of the projects I built on this framework. It is a self-updating customer support agent that is trained on your support docs. Not only does this answer your customer questions, it provides summaries of the queries so you get a sense of where your product and/or documentation is deficient.

35mmover 1 year ago

A text to slide based online course video with images workflow.I’m working for an edTech company. Some students prefer video. So I built a Django app that takes a block of text and formats it into a set of slides, each with a title, some bullet points, an Dalle-3 generated image, and a voiceover.It then compiles that all into a video.

marianoguerraover 1 year ago

An Extensible Conversational UI for Interactive Components[1][2], current use case is a Personal Productivity Assistant for structured data.The stack is simple, preact in the fronted with a custom framework on top and bun on the backend calling OpenAI, I may port it to rust in the future.I plan to try local LLMs when I have some free time.For now each users runs the application locally with their own keys[3].[1] <a href="https://www.youtube.com/watch?v=nS1wsif3y94" rel="nofollow">https://www.youtube.com/watch?v=nS1wsif3y94</a>[2] <a href="https://www.youtube.com/watch?v=f-txlMDLfng" rel="nofollow">https://www.youtube.com/watch?v=f-txlMDLfng</a>[3] Alpha software, check the readme: <a href="https://gloodata.com/download/" rel="nofollow">https://gloodata.com/download/</a>

ishtanbulover 1 year ago

request - i want an LLM tool that can process raw text or email and update or create salesforce records.Example 1: i get an email from a potential customer that says they want [product A]. I can forward that email (or call notes) to salesforce (or somewhere) and it will understand the preference and the relevant customer and update that customer's profile.Example 2: In a B2B context, lets say my customer is a company, and there is a news article about them. I could forward a link to the article to the LLM and it would understand that the article is about a customer, and append that article and key info about it to my saleforce record for that customer. The news item becomes an object that is linked to that customer (for call context, better sales targeting, profiling, etc).Can someone help me build that?

评论 #39267939 未加载

a5huynhover 1 year ago

I've been using a combo of LLMs + live transcription to build a passive assistant that keeps track of talking points and can pull out data/tasks from a conversation you're having (<a href="https://sightglass.ai" rel="nofollow">https://sightglass.ai</a> or here's a demo of me using it: <a href="https://www.loom.com/share/0220ca03bce341669d314d4254872226" rel="nofollow">https://www.loom.com/share/0220ca03bce341669d314d4254872226</a>)So far this is being used for:- Sales -> guiding new recruits during more complex client calls- HR -> Capturing respones during screening interviewsIf you'd like to try this out feel free to DM me or email me at andrew at sightglass.ai, we're looking for more testers!

computerexover 1 year ago

I am building a no code solution. Use case is simple: Write complete programs/software for the browser from natural language input.<a href="https://domsy.io" rel="nofollow">https://domsy.io</a>Currently running on my little digital ocean droplet. Stack is javascript/python.

评论 #39284725 未加载

bonamikoover 1 year ago

I am working on building out a better voice interface for LLMs.It is still a work in progress (early beta), but you can check it out at <a href="https://www.bonamiko.com" rel="nofollow">https://www.bonamiko.com</a>Currently I have mainly been using it as a tandem conversation partner for a language I'm learning, but it can be used for many more things. As it is right now, you can use it to bounce ideas of, practice interviews, and help answer quick general questions. You just need to tell it what you want.The stack is a Next.js application hosted on Vercel using Supabase for the backend. (There is also some plumbing in AWS for email and DNS.) It is automatically deployed via GitHub actions.

评论 #39266161 未加载

vkehfdl1over 1 year ago

I'm interested in RAG, so I make benchmarking & optimization tool for RAG system that using LLM. AutoRAG : <a href="https://github.com/Marker-Inc-Korea/AutoRAG">https://github.com/Marker-Inc-Korea/AutoRAG</a>Since it is python library, we deploy it to pypi. But for using it on my own, I am using H100 linux server on the torch docker & CUDA. Running it needs only vim and bash. And plus, for running local model I love VLLM. I make my own VLLM Dockerfile and use it for deploying local model in 5 minutes.FYI : Borrowing whole H100 instance is really expensive, but in my hometown, the government support us the instance for researching AI.

ilakshover 1 year ago

I started working on a Rust based AI agent host with the goal of running locally. It has Rhai scripting built in which is what the agent function calling is based on. Very rough at the moment. Also on hold for me because I need to do more dirt cheap Upwork projects to scrape by this month.I think what will be really powerful is to have a registry for plugins and agents that can be easily installed in the system. Sort of like WordPress in that way. Also similar to an open source GPT store.<a href="https://github.com/runvnc/agenthost">https://github.com/runvnc/agenthost</a>I believe the are several variations of this type of idea out there.

foxhopover 1 year ago

<a href="https://github.com/russellballestrini/flask-socketio-llm-completions">https://github.com/russellballestrini/flask-socketio-llm-com...</a>This project is a chatroom application that allows users to join different chat rooms, send messages, and interact with multiple language models in real-time. The backend is built with Flask and Flask-SocketIO for real-time web communication, while the frontend uses HTML, CSS, and JavaScript to provide an interactive user interface.demo here supports communication with `vllm/openchat`:* <a href="http://home.foxhop.net:5001" rel="nofollow">http://home.foxhop.net:5001</a>

jason_zigover 1 year ago

I run a survey platform[0] and I use an LLM to generate insights from open-ended response data. Using it for open-ended response classification as well.[0]<a href="https://www.zigpoll.com" rel="nofollow">https://www.zigpoll.com</a>

realtimexover 1 year ago

We build <a href="https://aichat.realtimex.co" rel="nofollow">https://aichat.realtimex.co</a>, a customer support AI working along side Human Agents. It's a RAG system with embeddings built from crawling pages of the website and user-uploaded documents including dynamic databases (such as products and pricing). The key difference with other LLM's CS products is the collaboration between the AI Agents and Human Agents. We are inspired by aircraft's pilots and autopilots collaboration. In this case, the AI and Human Agents silently collaborate to bring the best support to customers.

sturzaover 1 year ago

I built a RAG implementation for 35k books/articles/wiki pages/web pages i collected over the years(it took about 6 weeks on 3070ti 100% constant usage). I query it with various steps of data extraction/narrative building/refining etc, over LLMs. Almost daily i figure out new steps to add to the pipeline and honestly, i could not imagine learning about niche topic x from so many perspectives/periods in such a short time(including the original source). I did not yet figure out how to package this, but i spend at least 2h of my free time daily with it. Ideas and feedback is welcome.

评论 #39274957 未加载

评论 #39277010 未加载

评论 #39274927 未加载

pavlovover 1 year ago

An automatic video editor.It should be cheap enough to deploy that it can be applied to relatively low-value content like video meeting recordings, so it can’t spend a lot of expensive GPU time analyzing video frames.It also needs to be easily customizable for various content verticals and visual styling like branding and graphics overlays.And everything is meant to be open sourced, so that’s fun!I wrote about it on my employer’s blog here:<a href="https://www.daily.co/blog/automatic-short-form-video-highlight-reels-at-scale-with-ai-and-vcsrender/" rel="nofollow">https://www.daily.co/blog/automatic-short-form-video-highlig...</a>

actionfromafarover 1 year ago

I wish someone hooked up a chat interface to a CAD program. I find CAD very hard to get in to. It would be really nice to able to ask it how do stuff or to modify parts. Would be very "Star Trek in Holodeck" :)

smudge-aiover 1 year ago

Also a Chrome extension [0]! The concept is to use the browser's context menu to run commands on the LLM, so it stays out of your way most of the time but feels like a somewhat native experience.The stack is: 1. TypeScript/Node/tRPC/Postgres/Redis/OpenAI on the backend 2. SolidJS/Crxjs/tRPC on the front end 3. Astro for the docs/marketing siteAnd deployment is currently through render.com for the databases and servers, and manually via a zip file to the Chrome webstore for the extension itself.[0] <a href="https://smudge.ai" rel="nofollow">https://smudge.ai</a>

评论 #39265797 未加载

olegdaterover 1 year ago

A turing test disguised as a game:<a href="https://humanornot.so/" rel="nofollow">https://humanornot.so/</a>Heavily inspired by <a href="https://humanornot.ai/" rel="nofollow">https://humanornot.ai/</a> (which was a limited time research by Ai21 Labs), now the project is on its own path to be more that just a test.My work is to make AI chats sound like real humans and it's shocking how good sometimes the AIs are .Even I as a creator, knowing everything (prompts, fine-tuning data, design, backend etc.), often can't tell if I'm speaking to human or designed by me AIs

youngNedover 1 year ago

Ooooh, not something i have built, I do want to but suspect someone else has done it better than i could.A tool to RAG a github repo, so i can ask questions of how a certain library or tool works? Even better if it pulls in issues

评论 #39265572 未加载

c990802over 1 year ago

I've built an open-source ChatGPT UI designed for team collaboration.Github Link: <a href="https://github.com/joiahq/joia">https://github.com/joiahq/joia</a>Benefits vs the original: - Easy to invite entire teams and centralize billing - Talks to any Large Language Model (eg: Llama 2, Mixtral, Gemini) - Collaborative workspace to easily share GPTs within the team, similar to how Notion pages work - Savings of 50%-70% vs ChatGPT's monthly subscriptionTech stack: NextJS, Trpc and Postgres. All wonderful technologies that have helped me develop at the speed of thought.

guidsenover 1 year ago

I'm building <a href="https://www.getmosaic.io" rel="nofollow">https://www.getmosaic.io</a> that helps GTM teams enrich lead data and power personalization at scale, by integrating 30+ data providers and web scraping.I've built this by using AI as the foundation for everything. I am using LLMs to classify information and extract structured data points for any webpage, or RAG for finding data.Tech stack: - Mistral 8x7b and Perplexity API for data processing and GPT-4 input - GPT-4 for content output - pgvector in Supabase - LangChain for the pipeline and RAG stuff

RonanODover 1 year ago

AI Assisted Open Source Communication App for Autism - <a href="https://github.com/RonanOD/OpenAAC">https://github.com/RonanOD/OpenAAC</a>It's a flutter app (in beta on Google play store currently) that uses OpenAI embeddings with Postgres pg_vector DB hosted in Supabase. Any poor matches go to Dalle3 for generation.Our charity (I am vice-chair on the board) is hoping to use it as part of our program: <a href="https://learningo.org/app/" rel="nofollow">https://learningo.org/app/</a>

neomover 1 year ago

I use sponsor block and it's really good, I like that it's community-driven but sometimes it's not available for videos so your solution sounds great.I consult to a law firm as their founder-in-residence. For fun, I trained Llama 2 on all the non-client data of the firm so that people could ask it questions like "Who are the lawyers in Montreal who litigate American securities laws, what are their email addresses and what time is it where they are?" It's a njs app running on linode.It's extremely simple, but people seem to find it useful.

kingkhalidover 1 year ago

I was frustrated with ChatGPT's inability to answer questions of popular-but-not-that-popular open-source projects. So I helped build a ChatGPT-like tool that can answer questions about any open-source project, and you can add your own (public) GitHub repositories to it. The tool is meant to be used by sales engineers, but can be used by anyone.Check it out here: <a href="https://app.commonbase.ai/" rel="nofollow">https://app.commonbase.ai/</a>It has been a huge help for me when working with certain open-source libraries.

评论 #39269311 未加载

ukj02over 1 year ago

Me and a friend built Mysterian. It allows you to draft AI replies, summarize emails, and chat with your inbox.We used Plasmo to build the chrome extension, React for the frontend, and currently OpenAI as the LLM provider.Currently it only works with Gmail but we plan on adding other email providers as well.Feel free to check it out: <a href="https://chromewebstore.google.com/detail/mysterian-ai-for-gmail/dbcljcmcoagfhllldkncjghbbhmbcamm" rel="nofollow">https://chromewebstore.google.com/detail/mysterian-ai-for-gm...</a>

jordanfover 1 year ago

I wrote gait, an LLM-powered CLI that sits on top of git and translates natural language commands into git commands. It's open-source: <a href="https://github.com/jordanful/gait">https://github.com/jordanful/gait</a>I also wrote PromptPrompt, which is a free and extremely light-weight prompt management system that hosts + serves prompts on CDNs for rapid retrieval (plus version history): <a href="https://promptprompt.io" rel="nofollow">https://promptprompt.io</a>

Prosammerover 1 year ago

I am building textool [1] an app that lets you create endpoints using GPT4. The idea is to make it so you can create "actions" for GPT4 assistants easily.<pre><code> - Nextjs - Deno Deploy for hosting the apis - Supabase - postgres / auth - Shadcn </code></pre> I want to use the t3 app stack [2] for v2.It's really MVP, but I want to see if anyone is interested at all before I work on v2: creating gpts that come with databases!<pre><code> [1] https://textool.dev [2] https://create.t3.gg/</code></pre>

评论 #39265042 未加载

chownover 1 year ago

I am working on an app to make it even easier to run Local LLMs and support for multiple chats, RAG, and STT. I did it mostly for learning about different tasks that’s possible using local LLMs and specifically for my wife who was working overwhelmed with those things (and for some reason was overwhelmed setting up Ollama. Tech stack is Electron + NuxtJS, currently only for Mac but I have already started tinkering with Windows support.<a href="https://msty.app" rel="nofollow">https://msty.app</a>

dscoover 1 year ago

I created a Chrome extension which shows cryptocurrency prices & insights when you hover cash tags on Twitter. I'm a product manager with solid CS understanding, but haven't had the time to learn React or glue frontend stuff together - so about 80% of the code is generated by GPT4. I've mainly architected the code and deployed on Vercel. I feel like AI + Vercel has given me that final push to actually deploy products instead of just building stuff and leave it lying around.

jptlnkover 1 year ago

I built the copilot for flux.ai, which allows LLM-driven interaction with circuit schematics and datasheets.The stack is react / cloud run / job queue / LLMs (several) / vector db.

dartosover 1 year ago

I’m working on some tools to help GMs of tabletop games make content for their players.Little demo is up at npcquick.app.Doesn’t look like much rn, but there’s no openai involved. Currently it doesn’t even use a gpu.

devbytesover 1 year ago

I am working on a part search engine for company maintenance teams. We built a search engine that searches parts in real time across a dozen or so vendors (Amazon, eBay, McMaster, etc). We then leverage Chat GPT to extract data from product titles. Part number is one of the key elements we extract. Since part numbers vary greatly across manufacturers, it's difficult to throw something like a regex at it. It has done a really good job so far for data extraction.

joshelgarover 1 year ago

1. An infinite crafting game: <a href="https://foodformer.com" rel="nofollow">https://foodformer.com</a>2. An embeddings-based job search engine: <a href="https://searchflora.com" rel="nofollow">https://searchflora.com</a>3. I used LLMs to caption a training set of 1 million Minecraft skins, then finetuned Stable Diffusion to generate minecraft skins from a prompt: <a href="https://multi.skin" rel="nofollow">https://multi.skin</a>

评论 #39268648 未加载

elpockoover 1 year ago

Absurd news article generator using local LLMs. I wanted to create a static website from the articles, but ultimately didn't think anyone would give a damn. In the same vein I create a person + CV generator, and a group chat between simulated crazy people.I made a private Discord bot for me and my friends to talk to, that also generates images using SD 1.5 LCM.The self-hosted backend uses the ComfyUI Python API directly for images, and the LLM part uses oobabooga's web API.

mfalconover 1 year ago

I'm making two LLM's negotiate the exchange of a product, price is the main issue but I'm trying to make them negotiate another issues too in order to avoid the "bargaining" case.I've tried several models and gpt4 is currently the one that better performs, but OS LLM's like Mixtral and Mixtral-Nous are quite capable too.<a href="https://github.com/mfalcon/negotia">https://github.com/mfalcon/negotia</a>

ukuinaover 1 year ago

I built <a href="https://HackYourNews.com" rel="nofollow">https://HackYourNews.com</a> to summarize the top HN stories and their comments.

评论 #39273421 未加载

franzbover 1 year ago

I built an app to make dealing with Jira less painful. It caches Jira tickets in a SQLite database, then uses GPT-3.5 to translate natural language queries into SQL that it then executes. It also uses Ollama/Mixtral to summarize Jira tickets and GitHub PRs. It can generate a summary of a single Jira ticket with its associated GitHub PRs or a whole sprint. It's written in Python and runs in the terminal.

yodiover 1 year ago

I'm building a platform where product managers and engineers can build interaction automation with users using small model. The goal is to help people to build LLM for them without deep expertise in DS/ML, train and host the model in their infrastructure, where no data require to be submitted.Still on progress at <a href="https://www.chathip.com/" rel="nofollow">https://www.chathip.com/</a>

eagrwlover 1 year ago

I built a tool that uses LLMs to write a literature review on any research topic. (<a href="https://www.epsilon-ai.com">https://www.epsilon-ai.com</a>).It gives back ChatGPT styled answers, but they contain citations to underlying academic articles so that you know the claims are valid. Clicking on the reference actually takes you directly the paragraph in the source material where the claim was found.

suyulaover 1 year ago

I built Joke-Understander bot, a Mastodon bot that responds to a joke setup before the punchline is revealed. It's not very popular but I think it's hilarious.<a href="https://botsin.space/@jokeunderstander" rel="nofollow">https://botsin.space/@jokeunderstander</a>It's just a bash script that calls ollama on my desktop PC every morning and schedules a handful of posts on the Mastodon server.

评论 #39281761 未加载

Mr-Frogover 1 year ago

My project team in university built a meme generator that uses GPT and Dall-E to generate image macros using Impact font. It was pretty entertaining.

jacomoRodriguezover 1 year ago

Me and an colleague working on a language learning app <a href="https://poli.xyz" rel="nofollow">https://poli.xyz</a>. It integrates in you favorite messenger and offers a wide variety of languages. You can either either do freestyle conversations or play certain scenarios. The bot corrects your Grammatik, translates and explains words and sentences and support tts and stt.

benjaminwoottonover 1 year ago

I built this demo of using LLMs to query databases, knowledge bases, and most interestingly create PDFs. It’s targeted at financial services but similar could be achieved in many industries.Very pleased with how it turned out as it really brings the potential of LLMs to life IMO.<a href="https://www.youtube.com/watch?v=r8MyAxyPJsA" rel="nofollow">https://www.youtube.com/watch?v=r8MyAxyPJsA</a>

shoucheover 1 year ago

Created an AI explainer app that helps you understand a topic, kind of like Perplexity.It's currently free to use. Its built using nextjs+tailwind and is powered by Vercel + Brave + Gemini Pro. <a href="https://xplained.vercel.app" rel="nofollow">https://xplained.vercel.app</a>There are other projects that I worked on as part of my job, mostly around bots, search, classification, and analytics.

tonymarksover 1 year ago

We've been training custom LLMs using indices pulled from domains. For example, we demo'd the NFL with a Chicago Bears custom Chatgpt site search. We trained it using over 900 pages from their site, and then used reinforced human training to really polish it up. <a href="https://sapien.ai" rel="nofollow">https://sapien.ai</a>

guywhocodesover 1 year ago

I built a tool to create "average llm" probability of code for checking how aligned code is with what an LLM would output. Working on adding context from a project to check how the style of a section aligns with the style, content and domain of a project.Idea is to use it to identify code that sticks out, because that usually what's interesting or bad.

hellstenover 1 year ago

<a href="https://github.com/christianhellsten/ollama-html-ui">https://github.com/christianhellsten/ollama-html-ui</a>I'm building a minimal, cross-browser, and cross-platform UI for Ollama.Stack: HTML, CSS, JavaScript, in other words, no dependency on React, Bootstrap, etc. Deployment: web server, browser extension, desktop, mobile

smusamashahover 1 year ago

A chrome extension to ask about selected text with a right click. <a href="https://github.com/SMUsamaShah/LookupChatGPT">https://github.com/SMUsamaShah/LookupChatGPT</a>A chrome extension to show processed video overlay on YouTube to highlight motion.A script to show stories going up and down on HN front page. This one just took 1 prompt.

golergkaover 1 year ago

I've built a sales bot that would go over a predefined sales scenario like a real human would, being able to jump between steps and work with any complications real conversation would throw at it. It would appear fully human to whoever converted with it. Unfortunately, it was never deployed in production due to business reasons.

评论 #39267854 未加载

jlawrence6809over 1 year ago

I built a platform for homeschooling families with structured courses that are taught and graded by an llm (chatgpt 4 API).Homeschoolmate.com

ben_wover 1 year ago

I'm not sure if this is the category of "build" that you had in mind, but I used 3.5 to make a pay-as-you-go chat interface for the OpenAI API: <a href="https://benwheatley.github.io/YetAnotherChatUI/" rel="nofollow">https://benwheatley.github.io/YetAnotherChatUI/</a>

ChikkaChiChiover 1 year ago

Built a tool to summarize certification and licensing costs associated with jobs that require State credentialing.

biosboiiiover 1 year ago

Wrote an application to find myself a flat in Berlin, scans some rental websites every minute, uses Google Maps API to calculate the distance to my office, and summarizes the rental description with the GPT-4 API, sends it to me via Telegram.I have no time to read all that generic "vibrant neighborhood" stuff :D

评论 #39287470 未加载

评论 #39267253 未加载

ciguyover 1 year ago

I built <a href="https://listingstory.com" rel="nofollow">https://listingstory.com</a> as a way to learn about and play with LLMs. It's unlikely to ever be a commercial success, but it served it's purpose in allowing me to learn much more about how an LLM powered app works.

arionhardisonover 1 year ago

I made a platform that helps you create and execute multi party workflows, right now focused on Health but later on looking to expand to other verticals. The LLM acts as an assistant when building the protocol for the workflow.<a href="https://codifyhq.com" rel="nofollow">https://codifyhq.com</a>

sebnunover 1 year ago

I built a language learning tool [1] that uses LLMs to get word definitions in the context of a sentence, among other features I'm planning to release.I'm using modal.com as the backend for the AI related micro services.[1] <a href="https://www.langturbo.com" rel="nofollow">https://www.langturbo.com</a>

etewiahover 1 year ago

I am stunned at how few people are sharing projects using LLMs for real estate.I own the domain homestocompare and I am working on a project that will use AI to help compare homes. Unfortunately I don't have a working demo yet but please reach out to me if you would be interested in finding out more.

评论 #39289092 未加载

评论 #39277235 未加载

og_kaluover 1 year ago

I use GPT-4/4-vision and other models as part of a pipeline for automatically translating comics(French/European stuff as well as Manga, Webtoons etc)<a href="https://github.com/ogkalu2/comic-translate">https://github.com/ogkalu2/comic-translate</a>

nomoreipgover 1 year ago

An AI agent to answer questions about any github/gitlab repository. www.useadrenaline.comIt does the work of understanding questions in the context of a repo, code snippet, or any programming question in general, and pulls in extra context from the internet with self thought + web searches.

rickcarlinoover 1 year ago

I wrote a flash card app that uses GPT-4 and Whisper speech-to-text to help me memorize Korean phrases. I’m 1,800 sentences in and use it every day since October.<a href="https://github.com/RickCarlino/KoalaSRS">https://github.com/RickCarlino/KoalaSRS</a>

based_gigachad2over 1 year ago

> I worked on a chrome extension a few weeks ago that skips sponsorship sections in YouTube videos by reading through the transcriptYou might want to connect that to SponsorBlock<a href="https://sponsor.ajay.app/" rel="nofollow">https://sponsor.ajay.app/</a>

sethkimover 1 year ago

I built <a href="https://tailgate.dev/" rel="nofollow">https://tailgate.dev/</a> a few months ago. It can help with deployment of simple, client-facing generative web apps. There are a few simple demos on the home page!

qeternityover 1 year ago

Perhaps a follow on question, as I presume a lot of people reading the comments are looking for inspiration to build things (and those building might not want to reveal yet) what would you like to see built with the capabilities provided by LLMs?

评论 #39267429 未加载

patelajay285over 1 year ago

We've built a prompting, synthetic data generation, and training library called DataDreamer: <a href="https://github.com/datadreamer-dev/DataDreamer">https://github.com/datadreamer-dev/DataDreamer</a>

rwxdover 1 year ago

I built a blog with stupid & overengineered technical solutions. Also has an audio interview for every blog post.<a href="https://shitops.de/posts/" rel="nofollow">https://shitops.de/posts/</a>

xanderbarkhatovover 1 year ago

Found myself needing to find emojis relevant to a specific theme and wanted to play with OpenAI's API. So, I built <a href="https://emojisearch.fun" rel="nofollow">https://emojisearch.fun</a>

maytcover 1 year ago

Built an LLM interface to control my browser. Used it to generate playwright tests for me <a href="https://github.com/mayt/BrowserGPT">https://github.com/mayt/BrowserGPT</a>

jasfiover 1 year ago

I'm building AI Construx (<a href="https://aiconstrux.com" rel="nofollow">https://aiconstrux.com</a>): build things with AI. I'm planning to launch the private beta by end of Feb.

sanketskasarover 1 year ago

Can the smart folks on HN point out to a good resource or a collection of resource for a software engineer to get up to speed with LLMs and Gen AI concepts, and understand basic deployments and use cases?

评论 #39276150 未加载

arbugeover 1 year ago

I built <a href="https://QexAI.com" rel="nofollow">https://QexAI.com</a>.I also use LLMs in some other web apps, but mainly as incidental writing aids, rather than the central feature of the app.

Libcat99over 1 year ago

I built a completely useless ai phone bot. You call it and ask it a question, and it responds with an answer that always involves sandwiches.It adds no value beyond entertainment, but I suppose it does do that.

评论 #39301789 未加载

waqas_xover 1 year ago

Upload product photos, get detailed, seo optimized, product descriptions. <a href="https://producks.ai/" rel="nofollow">https://producks.ai/</a>

jerpintover 1 year ago

An open source retrieval augmented generation (RAG) framework:<a href="https://www.github.com/jerpint/buster">https://www.github.com/jerpint/buster</a>

gxaraover 1 year ago

I've built a tool to help students in the note-taking process.It is <a href="https://cmaps.io" rel="nofollow">https://cmaps.io</a>

justanothersysover 1 year ago

i built autosuggestions / catch all prompt responses on <a href="https://aesthetic.computer" rel="nofollow">https://aesthetic.computer</a> and you can also talk to characters like boyfriend, girlfriend, husband and wife. characters are great for kids and older users who really wouldn't experience the tech otherwise.

sayyidkaover 1 year ago

I'm building a chatbot (API + frontend) to transcribe natural language questions into SQL query for a Snowflake database.

itaydrover 1 year ago

An app for making children’s stories.<a href="https://schrodi.co/" rel="nofollow">https://schrodi.co/</a>

Kappa90over 1 year ago

An app that aggregates the news from websites, blogs, YouTube channels and podcasts, and generate easily digestible summaries, along with a small podcast version so you can stay informed in an easy stress-free way.Right now I’m working on including automatic fact checking and insights on how each source might be opinionated vs. reporting just the facts.<a href="https://usetailor.com" rel="nofollow">https://usetailor.com</a>

coolvisionover 1 year ago

bookmarking extension, not much traction though <a href="https://chromewebstore.google.com/detail/autolicious/jbmpomloomhbfflncpmcmfajbppfddjk?pli=1" rel="nofollow">https://chromewebstore.google.com/detail/autolicious/jbmpoml...</a>

keiferkifover 1 year ago

I'm making a Magic Card generator

评论 #39266239 未加载

iseouluover 1 year ago

I built a simple RAG chatbot, and my "stack" is plain openai python client at this point.

bengarneyover 1 year ago

I wrote an autonomous AI space opera tv show generator. It takes a short topic phrase on one end and spits out a 10-15 minute 3D animated and AI voiced video suitable for upload to YouTube on the other end.Super interesting learning exercise since it intersects with many enterprise topics, but the output is of course more fun.In some ways it is more challenging - a summary is still useful if it misses a point or is a little scrambled, whereas when a story drops a thread it’s much more immediately problematic.I’m working on a blog post as well as getting a dozen episodes uploaded for “season 1”.

ropableover 1 year ago

Semi-automated transcriptions for my favourite podcast, via OpenAI Whisper.

tomcamover 1 year ago

The range of creativity and ingenuity in these answers is mind-boggling!

resbalosoover 1 year ago

A little AI domain name generator: <a href="https://namebrewery.com/" rel="nofollow">https://namebrewery.com/</a>Used SvelteKit and Supabase. Deployed to Cloudflare Pages.

aakilover 1 year ago

I built a flask app based chrome extension that takes content from the DOM and sends it to chatGPT for summarization, I also configured it to work on YouTube videos and PDFs, helps when you want to share the tl;dr of a site or video to a friend, I'm thinking I'm going to add some more specific summary functionality next, like listing out a recipe's ingredients and cooking steps<a href="https://chromewebstore.google.com/detail/news-article-summarizer/pohjdmofkgcgbihdhiegnhghklhieadc" rel="nofollow">https://chromewebstore.google.com/detail/news-article-summar...</a>

brian_hermanover 1 year ago

My brother built a security scanner with an LLM

xtrohnxover 1 year ago

Just a personal project - I got a deep interest in the CIA's Stargate program and the declassified documents in the "reading room." I wrote a script to scrape all of the readable or OCRd text from the documents, and fed them into GPT-3.5 to get a summary. It definitely makes reading through the documents easier.I have all of the docs with summaries on a small webserver here: <a href="https://ayylmao.info" rel="nofollow">https://ayylmao.info</a>Simple Flask site with SQLite as the database.

评论 #39273060 未加载

jiggawattsover 1 year ago

It’s just a scratch that needed itching, but I wrote a command-line utility for translating “SRT” format subtitles into other languages.I hit some interesting challenges, overcoming which was a valuable set of lessons learnt:1. GPT4 Turbo slowed down to molasses in some Azure regions recently. Microsoft is not admitting this and is telling people to use GPT3.5 instead. The lesson learned is that using a regional API exposes you to slowdowns and queuing caused by local spikes in demand, such as “back to school” or end of year exams.2. JSON mode won’t robustly stick to higher level schemas. It’s close enough, but parsing and retries are required.3. The 128K context in GPT4 is only for the input tokens! The output is limited to 4K.4. Most Asian languages use as many as one token per character. Translating 1 KB of English can blow through the 4 KB token limit all too easily.5. You can ask GPT to “continue”, but then you have to detect if you received a partial or a complete JSON response, and stitch things together yourself… and validate across message boundaries.6. The whole process above is so slow that it hits timeouts all over the place. Microsoft didn’t bother to adjust any of their default Azure SDK timeouts for HTTP calls. You have to do this yourself. It’s easy, just figure which of the three different documented methods are still valid. (Answer: none are.)7. You’ll need a persistent cache. Just trust me on this. I simply hashed the input and used that as a file name to store responses that passed the checks.8. A subtitle file is about 30–100 KB so it needs many small blocks. This makes the AI lose the context. So it’s important to have several passes so it can double check and stitch things together. This is very hard with automatic parsing of outputs.9. Last but not least: the default mode of Azure is to turn the content policy up to “puritan priest censoring books”. Movies contain swearing, violence, and sex. The delicate mind of the machine can’t handle this, and it will refuse to do as it is asked. You have to dial it down to get it to do anything. There is no “zero censorship” setting. Microsoft says that I can’t feed text to an API that I can watch on Netflix with graphic visuals.10. The missus says that the AI-translated subtitles are “perfect”, which is a big step up from some fan translated subtitles that have many small errors. Success!I wrote this as a C# PowerShell module because that makes it easy to integrate the utility as a part of a pipeline. E.g.: I can feed it a directory listing and it’ll translate all of the subtitles.The performance issues meant I had to process 8x chunks in parallel. Conveniently I already had code lying around to do this in PowerShell with callbacks to the main thread to report progress, etc…

dorkwoodover 1 year ago

I generate and sell books that summarize historical events. I was actually ready to launch last month until I realized I could generate extremely realistic photographs in Midjourney and splice them between paragraphs using a simple python script, so I went back and did another pass.My process involves generating chapters as markdown, using a script to join chapters together, and then finally converting the markdown to ebooks using Gitbook.

评论 #39287726 未加载

评论 #39286024 未加载

JSDevOpsover 1 year ago

I've used it to write lots of Bash! Boilerplate stuff it's great for that.<a href="https://github.com/tg12/gpt_jailbreak_status">https://github.com/tg12/gpt_jailbreak_status</a><a href="https://github.com/tg12/data-analytics">https://github.com/tg12/data-analytics</a><a href="https://github.com/tg12/script-toolbox">https://github.com/tg12/script-toolbox</a>

ailiciousover 1 year ago

I built a tool to repeat a chat discussion against a set of data.Let say, you have a row with 4 fields, you chat with your row, then you apply same conversation to all other rows!<a href="https://www.youtube.com/watch?v=e550X6R89W4" rel="nofollow">https://www.youtube.com/watch?v=e550X6R89W4</a> <a href="https://bulkninja.com/" rel="nofollow">https://bulkninja.com/</a>