TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Ask HN: Open-Source GitHub Copilot?

28 pointsby ghoomketuover 2 years ago
Just wondering if for some reason copilot shuts down, I was wondering if it&#x27;s possible to home brew it.<p>Some hurdles I see:<p>- Github rate limits the GET requests so it doesn&#x27;t seem possible to scrape all the source code on there. But maybe it can be crowdsourced like seti@home so 1000 people can install a program to get around this.<p>- Training the model. I would imagine this would be hardest as it would need millions of dollars for this? Is there a way to get around it or using free tools like colab?<p>- Running the api. Once the model is trained, would it be possible to run it on a lenovo type laptop? I guess you need lots of VRAM to run it?<p>Final question is will a home brewed version be just as good? What factors determine that?<p>Just curious on how we can do it as I imagine there a lot of ML experts here.

7 comments

ericpauleyover 2 years ago
Short answer, yes: <a href="https:&#x2F;&#x2F;github.com&#x2F;moyix&#x2F;fauxpilot" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;moyix&#x2F;fauxpilot</a>
armchairhackerover 2 years ago
There is a 3TB model “The Stack” which I believe is partly designed for this: all of the code is properly licensed.<p>Training the model would be expensive but it’s a one-and-done process. With the model openly available cloud providers could provide a subscription service to end-users which recoups the cost of running it.<p>The only issue is I imagine GitHub has <i>much</i> more code than 3TB.
jacooperover 2 years ago
BTW, this wouldn&#x27;t solve the legal hurdles of Copilot. The model needs to mention which license the code has, which AFAIK Amazon&#x27;s competitor to Copilot already does that.
评论 #33491881 未加载
评论 #33492195 未加载
btwillardover 2 years ago
Not a complete replacement, but very cool and related: <a href="https:&#x2F;&#x2F;github.com&#x2F;webyrd&#x2F;Barliman" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;webyrd&#x2F;Barliman</a>
wokwokwokover 2 years ago
You’ll need &gt; 20 GB of GPU memory run the model.<p>This is the same reason people can’t easily “play with” GPT like models.<p>&gt; would it be possible to run it on a lenovo type laptop?<p>No.<p>You might, with a hybrid Mac book pro M1 or M2 with 64GB of combined memory; pretty much any other lapto, categorically no.<p>You’d have to rent &#x2F; own a separate server with epic GPU power.<p>&gt; Final question is will a home brewed version be just as good?<p>No.<p>The open source language models are not as good as GPT3.
forumrangerover 2 years ago
This is not really asked at you, but the whole idea of copilot. How about people just write their own code??
alrlroipspover 2 years ago
I don&#x27;t think most open source dev:s want CoPilot or a FOSS alternative for this very reason:<p>Code assist AI does no attribution.<p>This removes engagement between the dev and library authors. this ruins chances of engaging new contributors over time, eroding and killing the FOSS communities.<p>Code assist AI also does not care about licenses. See [1]<p>1: <a href="https:&#x2F;&#x2F;www.bleepingcomputer.com&#x2F;news&#x2F;security&#x2F;microsoft-sued-for-open-source-piracy-through-github-copilot&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.bleepingcomputer.com&#x2F;news&#x2F;security&#x2F;microsoft-sue...</a>
评论 #33492201 未加载