2 pointsby Anil1331over 1 year ago

NExT-GPT is the first end-to-end MM-LLM that perceives input and generates output in arbitrary combinations (any-to-any) of text, image, video, and audio and beyond.

1 comment

jdwgover 1 year ago

I gave it a photo of my face, and it said that I look approachable and friendly. I asked it to improve my hair, and it replaced me with a completely different person. Oh well! Still very impressive tech.

评论 #37598012 未加载

Show HN: NExT-GPT – First LLM working with multimodal input and output

1 comment

Show HN: NExT-GPT – First LLM working with multimodal input and output

1 comment