TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

GitLab is working on a tool just for data teams

233 pointsby TheMissingPiecealmost 7 years ago

15 comments

slap_shotalmost 7 years ago
This looks like an amalgamation of 8+ open source projects or industries with products put forth by companies that have dozens of employees and worked on their products for years.<p>It also doesn&#x27;t even categorize the products they compete with correctly[0].<p>Why not contribute some of your resources to one of the many active open source libraries already trying to solve some of these problems, and focus your engineering efforts on your core product?<p>[0] Fivetran is only considered &quot;Orchestrate&quot; but is actually competes directly with Alooma in the Extract and Load. Also, there are DOZENS of company in that space. <a href="https:&#x2F;&#x2F;gitlab.com&#x2F;meltano&#x2F;meltano&#x2F;blob&#x2F;master&#x2F;README.md#data-science-lifecycle" rel="nofollow">https:&#x2F;&#x2F;gitlab.com&#x2F;meltano&#x2F;meltano&#x2F;blob&#x2F;master&#x2F;README.md#dat...</a>
评论 #17668152 未加载
评论 #17668578 未加载
评论 #17668888 未加载
cheghookalmost 7 years ago
I can&#x27;t understand why GitLab thinks they have to embark on a new project every so often instead of focusing on their current product and features. There is just a lot to work on, so many of the current features&#x2F;products are half assed. At my place we moved to GitLab 2.5 years ago and updates where smoother back then but the past few months we had to hire a new sys admin for our build machines and GitLab server to follow on new issues created on GitLab.com and decide if it&#x27;s safe release and even then he still reports 4-5 issues to GitLab support after every update. We were expecting it to be an easy `yum update` like a normal package but it&#x27;s just getting worse update after update. It&#x27;s so bad that my manager asked me to look into GitHub + another CI&#x2F;CD solution.
评论 #17669312 未加载
评论 #17668343 未加载
评论 #17670656 未加载
georgewfraseralmost 7 years ago
Data pipelines are not a great subject for an open-source project. We&#x27;ve been building these for the last 3+ years at Fivetran, and I can tell you that the challenge is:<p><pre><code> - Studying each source to figure out the right data model - Chasing down a million weird corner cases - Working around dumb bugs in the data sources </code></pre> This is the kind of problem where paying for software really works better. When people build data pipelines in-house, they tend to hack at it until it works for their use case and then stop. When we build data pipelines, we map out every feature of the data source, implement the whole thing at once, and then put it through a beta period with <i>multiple</i> real users. This is easy to do when you have a tight-knit dev team; much harder for a group of part-time open-source contributors.
评论 #17670236 未加载
评论 #17669875 未加载
评论 #17669750 未加载
评论 #17670598 未加载
tbrockalmost 7 years ago
I wish they would focus on making a fast, stable, GitHub alternative.
评论 #17670855 未加载
n42almost 7 years ago
Is there any example of an open source software company that has taken on so many products at once, so early in its life, and succeeded?
评论 #17668828 未加载
veritas3241almost 7 years ago
Taylor from GitLab here! Happy to answer any questions about what we&#x27;re doing.
评论 #17668506 未加载
_pmf_almost 7 years ago
GitLab&#x27;s usage of team members in marketing material is creeping me out (as does the whole team page[0]).<p>[0] <a href="https:&#x2F;&#x2F;about.gitlab.com&#x2F;team&#x2F;" rel="nofollow">https:&#x2F;&#x2F;about.gitlab.com&#x2F;team&#x2F;</a>
评论 #17671406 未加载
ageofwantalmost 7 years ago
<a href="https:&#x2F;&#x2F;quiltdata.com&#x2F;" rel="nofollow">https:&#x2F;&#x2F;quiltdata.com&#x2F;</a> ticks a lot of boxes in this space for me.
评论 #17670872 未加载
评论 #17668068 未加载
danpalmeralmost 7 years ago
Reading this I was concerned that it would be written in Ruby. While Ruby is a reasonable language for server development, it has almost no data science community when compared with some other ecosystems.<p>I was very glad to see this is Python! Python has some of the best data tools out there, and a mature ecosystem for solving all the engineering problems that go along with a great data stack.
评论 #17673997 未加载
tamersalamaalmost 7 years ago
Is there some resemblance with Floydhub <a href="http:&#x2F;&#x2F;floydhub.com&#x2F;" rel="nofollow">http:&#x2F;&#x2F;floydhub.com&#x2F;</a> ?
评论 #17670846 未加载
评论 #17671045 未加载
Luuseensalmost 7 years ago
The page talks mentions MVC, and the issue page[0] keeps mentioning MVC as well. Was this supposed to be MVP, or something else? Model-view-controller doesn&#x27;t make sense in the context.<p>[0] <a href="https:&#x2F;&#x2F;gitlab.com&#x2F;meltano&#x2F;meltano&#x2F;issues&#x2F;10" rel="nofollow">https:&#x2F;&#x2F;gitlab.com&#x2F;meltano&#x2F;meltano&#x2F;issues&#x2F;10</a>
评论 #17670661 未加载
ajboscoalmost 7 years ago
Do you see this as a (future) competitor of Airflow&#x2F;Luigi type workflow tools?
评论 #17671727 未加载
hn_throwaway_99almost 7 years ago
Be interested to know all the competitors in this space. <a href="https:&#x2F;&#x2F;data.world&#x2F;" rel="nofollow">https:&#x2F;&#x2F;data.world&#x2F;</a> is one I am most familiar with.
评论 #17668217 未加载
评论 #17668023 未加载
评论 #17668066 未加载
gandutraveleralmost 7 years ago
Looks like gitlab just wants to be in news since Microsoft&#x27;s aquisition of GitHub.
sbr464almost 7 years ago
Are you releasing&#x2F;sharing any of the extractors you built for various services?
评论 #17670668 未加载