TextSynth Server

274 pointsby catfishxabout 2 years ago

15 comments

I've been feeling FOMO (for lack of a better term) about recent AI & ML/GPT progression.It feels like ML/AI it might be the beginning of the end for a large class of things (if I wanted to be alarmist I'd say "everything") -- and the fact that Fabrice Bellard has jumped in and done the absolutely obvious rising-tide thing (building an API that abstracts the technologies) speaks volumes.Releasing something like this fits to Fabrice's pattern of work -- he built Qemu and that served as a similar enabling fabric for people to run virtual machines. QuickJS quietly powers some JS-on-another-platform functionality.Simon was right. The Stable Diffusion moment[0] is already here. It's going to accelerate. It was already moving at a speed that was hard to follow, and it's about to get even faster.There are too many world-changing things moving forward at the same time, and I'm only looking at such a small cut of the tech sphere. I don't know what to do with myself, I feel so thoroughly unprepared.[0]: <a href="https://simonwillison.net/2023/Mar/11/llama" rel="nofollow">https://simonwillison.net/2023/Mar/11/llama</a>

评论 #35200449 未加载

评论 #35198948 未加载

评论 #35196606 未加载

评论 #35195041 未加载

评论 #35194910 未加载

networkedabout 2 years ago

> All is included in a single binary. Very few external dependencies (Python is not needed) so installation is easy on most Linux distributions.I have to disagree. The combination of being closed-source and dynamically linked makes a program a hassle to run on Linux. Even if it isn't at the moment of release, it soon becomes one. While ts_server is better than most, it already requires an old version of libjpeg-turbo not available in my distribution's repositories. I had to run it in a Rocky Linux container:<pre><code> docker run \ --rm \ --mount type=bind,source="$(pwd)",target=/app/ \ --publish 127.0.0.1:8080:8080 \ rockylinux:9 \ sh -c 'dnf install -y libjpeg libmicrohttpd && cd /app/ && ./ts_server ts_server.cfg' </code></pre> The solutions to this problem that I am aware of that do not involve releasing the source code are: 1) static linking; 2) containers; 3) shipping a Windows binary :-) ("Win32 is the only stable ABI on Linux" -- <a href="https://blog.hiler.eu/win32-the-only-stable-abi/" rel="nofollow">https://blog.hiler.eu/win32-the-only-stable-abi/</a>).

评论 #35197514 未加载

JoachimSabout 2 years ago

The Fabrice Bellard web page must be one of the most underselling ones on the entire web. So many amazing projects. Not a word that really emphasize the importance, coolness. Just a simple list with short factual descriptions.

评论 #35194567 未加载

JacobiXabout 2 years ago

Very interesting as usual from Fabrice Bellard, but I'm a little bit disappointed this time, because libnc is a closed source DLL. Nevertheless it will be interesting to compare it to the amazing work of Georgi Gerganov: GGML Tensor Library. Both are heavily optimized, supports AVX intrinsics and are plain C/C++ implementation without dependencies.

评论 #35195270 未加载

评论 #35195745 未加载

评论 #35211007 未加载

superkuhabout 2 years ago

Anyone know what format the models have to be in for use with textsynth? I looked at the gpt2 example binary (gpt2_117M.bin) and it seems like the "normal" params.json is embedded as a header for the binary and then some ascii string like "attn/c_attn/" and then the binary weights.I tried just using the Stanford Alpaca fine-tuned version of the llama 7B weights that work with llama.cpp with textsynth but it didn't like that (ggml-alpaca-7b-q4.bin: invalid file header). Having a textsynth HTTP API would save me a lot of hassle . I'm currently wrapping the stdin/out of a execution of a modified llama.cpp binary and that's extremely messy.

评论 #35215363 未加载

jefc1111about 2 years ago

In comments on this post, and elsewhere on other posts about AI, I see a lot of people referring to worries around the potential for lots of types of jobs to be heavily impacted by this technology.I feel like people are often referring to 'coding' when they express these worries. You know, actually writing code, having been given a spec to do so, and perhaps also participating in code review, writing tests, all the usual engineer stuff.My question is, amongst the HN crowd, what kinds of roles or areas do we think might be somewhat immune to this effect? The first thing that occurs to me are security, infrastructure & ops, networking. And of course the requirements gathering stage of software development. It is already the case that a lot of senior devs probably don't write much code and spend more time on communication between different stakeholders and overseeing whoever (or whatever) is writing the code.Anyone else been thinking about this? What tech roles might thrive in the face of AI.

评论 #35196449 未加载

评论 #35198629 未加载

canistelabout 2 years ago

This man, Fabrice Bellard again...Frankly, I have not seen a more impressive portfolio of programming output.

评论 #35194984 未加载

评论 #35194396 未加载

Loicabout 2 years ago

For me, the most interesting part is the statistics on all the models. These show that 8 bit quantization is basically as good as the full model and 4 bit is very close. This is the first time I see such table across a large number of models in one place.

评论 #35194356 未加载

pierrecabout 2 years ago

"The CPU version is released as binary code under the MIT license"This gives off the surreal sci-fi vibe that the binary is the source. And who knows... true wizards work in mysterious ways.

ggerganovabout 2 years ago

Very inspiring stuff! I hope one day we get to see the magic behind libnc.

generalizationsabout 2 years ago

Is it just me, or is the llama model not available to download there?Edit: nevermind, the models are all there, just some of the links aren't.

DeathArrowabout 2 years ago

If there were Oscars or Nobels for programming, Fabrice Bellard should have won one long ago!

评论 #35195595 未加载

aww_dangabout 2 years ago

Is it necessary to pre-process models from hugging face before using them with libnc ?

评论 #35206343 未加载

a_subsystemabout 2 years ago

"...REST JSON API..."Please update.<a href="https://roy.gbiv.com/untangled/2008/rest-apis-must-be-hypertext-driven" rel="nofollow">https://roy.gbiv.com/untangled/2008/rest-apis-must-be-hypert...</a>

评论 #35201367 未加载

summarityabout 2 years ago

> The GPU version is commercial software. Please contact...Shame.

评论 #35194800 未加载

评论 #35194432 未加载

评论 #35196511 未加载

评论 #35194366 未加载

评论 #35194370 未加载