11 pointsby jimminyx3 months ago

2 comments

timbilt3 months ago

anyone else concerned that training models on synthetic, LLM-generated data might push us into a linguistic feedback loop? relying on LLM text for training could bias the next model towards even more overuse of words like "delve", "showcasing", and "underscores"...

lenerdenator3 months ago

SOTA? Lora? Seems like people are trying to usurp ham radio names for things.

SOTA Code Retrieval with Efficient Code Embedding Models

2 comments

SOTA Code Retrieval with Efficient Code Embedding Models

2 comments