TechEcho

8 comments

shazami9 months ago

FYI wouldn't interview here. Got rejected after a 30 minute behavioral screen after spending 8 hours on an unpaid take-home.

评论 #41406384 未加载

评论 #41399278 未加载

dinobones9 months ago

Long context windows are IMO, “AGI enough.”100M context window means it can probably store everything you’ve ever told it for years.Couple this with multimodal capabilities, like a robot encoding vision and audio into tokens, you can get autonomous assistants than learn your house/habits/chores really quickly.

评论 #41397412 未加载

评论 #41395245 未加载

评论 #41395642 未加载

smusamashah9 months ago

It should be benchmarked against something like RULER[1]1: <a href="https://github.com/hsiehjackson/RULER">https://github.com/hsiehjackson/RULER</a> (RULER: What’s the Real Context Size of Your Long-Context Language Models)

评论 #41394574 未加载

评论 #41394468 未加载

fsndz9 months ago

Context windows are becoming larger and larger, and I anticipate more research focusing on this trend. Could this signal the eventual demise of RAG? Only time will tell. I recently experimented with RAG and the limitations are often surprising (<a href="https://www.lycee.ai/blog/rag-fastapi-postgresql-pgvector" rel="nofollow">https://www.lycee.ai/blog/rag-fastapi-postgresql-pgvector</a>). I wonder if we will see some of the same limitations for long context LLM. In context learning is probably a form of semantic / lexical cues based arithmetic.

Sakos9 months ago

I was wondering how they could afford 8000 H100’s, but I guess I accidentally skipped over this part:> We’ve raised a total of $465M, including a recent investment of $320 million from new investors Eric Schmidt, Jane Street, Sequoia, Atlassian, among others, and existing investors Nat Friedman & Daniel Gross, Elad Gil, and CapitalG.Yeah, I guess that'd do it. Who are these people and how'd they convince them to invest that much?

评论 #41395207 未加载

评论 #41394955 未加载

anonzzzies9 months ago

What is the state of art on context on open models? Magic won't be open I guess after getting 500m in VC money.

samber9 months ago

Based on Mamba ?

htrp9 months ago

does anyone have a detailed tech breakdown of these guys? not quite sure how their LTM architecture works.

评论 #41395903 未加载

8 comments

shazami9 months ago

FYI wouldn't interview here. Got rejected after a 30 minute behavioral screen after spending 8 hours on an unpaid take-home.

评论 #41406384 未加载

评论 #41399278 未加载

dinobones9 months ago

评论 #41397412 未加载

评论 #41395245 未加载

评论 #41395642 未加载

smusamashah9 months ago

评论 #41394574 未加载

评论 #41394468 未加载

fsndz9 months ago

Sakos9 months ago

评论 #41395207 未加载

评论 #41394955 未加载

anonzzzies9 months ago

What is the state of art on context on open models? Magic won't be open I guess after getting 500m in VC money.

samber9 months ago

Based on Mamba ?

htrp9 months ago

does anyone have a detailed tech breakdown of these guys? not quite sure how their LTM architecture works.

评论 #41395903 未加载

100M Token Context Windows

8 comments

100M Token Context Windows

8 comments