TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Launch HN: Jitsu (YC S20) – Open-Source Segment Alternative

265 pointsby vklmnover 3 years ago
Hey HN! Vlad here with Sergey, Ildar, and Kirill. We are building Jitsu, an open-source Segment alternative (<a href="https:&#x2F;&#x2F;github.com&#x2F;jitsucom&#x2F;jitsu" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;jitsucom&#x2F;jitsu</a>, <a href="https:&#x2F;&#x2F;jitsu.com&#x2F;" rel="nofollow">https:&#x2F;&#x2F;jitsu.com&#x2F;</a>). We help companies collect events from their apps, websites, and APIs and send them to databases.<p>I&#x27;ve been doing data engineering for more than ten years (half of that time, I didn&#x27;t know that it&#x27;s called &quot;data engineering”). Before Jitsu, I was a co-founder and CTO of GetIntent, an ad-tech startup. Although it was ad-tech (I&#x27;m sorry for that!), we also built a quite fascinating technology platform. We processed up to 1 million events per second at peak, and all those events needed to be stored somewhere.<p>We churned through a few data warehouse platforms along the way. In 2013, we started with Hadoop&#x27;s HDFS and a bunch of map-reduce jobs on top of it. Then, when we decided to allow our customers to run ad-hoc reports, we switched to BigQuery. BigQuery was great, but expensive—especially with some customers obsessively clicking the refresh button. Finally, in 2017 we migrated to self-hosted ClickHouse which in my opinion is still the best analytics database in the world.<p>All that time, we spent a fair amount of effort to get data to the database. When you&#x27;re dealing with millions of events per minute, running an INSERT statement per event won&#x27;t work. What if the DB is down for maintenance? How can you be sure that all 50+ edge nodes are aware of recent DB schema changes? Also, did you know streaming data to BigQuery is costly while batching data is free?<p>We tried different approaches: first, we would write local log files, sync them to HDFS, and load data to BQ (or ClickHouse) with map-reduce jobs. To improve data freshness, we ditched HDFS and started to send data in batches to the DB directly from edge servers. We experimented with Kafka, but it felt too complex for that task at the time.<p>I always dreamed about a straightforward service, to which I&#x27;d throw JSON objects, and it would take care of the rest: queueing, retrying, updating database schema, etc.<p>Then I discovered Segment. I liked it at first. It seemed very developer-friendly with a nice API and excellent documentation. But the pricing model and data delays (the event gets to DB in 12 hours after it has been sent to Segment) killed the whole idea. And it was not open-sourced. In my opinion, being open-source and self-hostable is a must for such a fundamental part of the architecture as data collection.<p>I left GetIntent and got accepted to YC with a different idea for the Summer 2020 batch. The idea was to build a churn prevention and BI tool for online retailers. It didn&#x27;t take off, but in the process we made a component to collect customer&#x27;s app events and put it to DB. We tried to hack a solution on top of the ELK stack, but I was frustrated with ElasticSearch’s lack of SQL support. Here I was back to square one: there&#x27;s no good open-source event collection service yet, and we needed to build one, once again.<p>So we decided to focus solely on that problem. We ditched all the previous code, which was in Java, rewrote the data collection server in Go and hacked together what we called EventNative [1]. It was received very well, and we started to get users.<p>Over the last 11 months, we&#x27;ve been busy building the UI, adding Connectors (to pull data from external APIs), polishing data warehouse support, adding javascript support to transform incoming data, and implementing dozens of other features.<p>Now we&#x27;re launching Jitsu, an open-source Segment alternative. With Jitsu, we make it easy to collect data and send it to databases (we support all major players: ClickHouse, Redshift, Snowflake, BigQuery and Postgres). We’re deployed in production, including into a large gaming publisher, eSignature service, and many other great companies. We&#x27;re going for an open-core model. So far we don&#x27;t have paid features, but soon we&#x27;ll have some, presumably around things like authorization and data masking. Also we run Jitsu.Cloud[2] which you can buy if you don’t want to self-host<p>Give it a spin: <a href="https:&#x2F;&#x2F;github.com&#x2F;jitsucom&#x2F;jitsu" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;jitsucom&#x2F;jitsu</a>.<p>Thank you for reading this story - I hope it was interesting. I would love to read your feedback on Jitsu and answer questions!<p>[1] <a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=24120325" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=24120325</a> [2] <a href="https:&#x2F;&#x2F;cloud.jitsu.com" rel="nofollow">https:&#x2F;&#x2F;cloud.jitsu.com</a>

37 comments

kposehnover 3 years ago
Great product. I&#x27;m a frequent user of Segment from the early days and have been curious to see when an open-source competitor comes around that will match feature-for-feature.<p>Thoughts:<p>1. You&#x27;ve got most major ads sources that I care about, but it <i>seems</i> that there is a higher bar to implementation. Segment lets me just plug in Google &amp; FB ads and dump the entire shebang right into my data warehouse. A lot of marketing teams are going to have less time&#x2F;resources to deal with implementation so smoothing this out is key.<p>2. Functions are an underrated and highly powerful feature of Segment. The ability to operate on data in transit, create custom connectors that &quot;just run&quot; (akin to CF Workers) and the like is a big selling point for more technically advanced marketing teams. It doesn&#x27;t seem present here and that would hold a customer such as myself back on bigger scope projects.<p>3. I&#x27;d love to see a &quot;compare us to <i>your</i> segment usage&quot; where I select my data sources and destinations to see what you cover vs. Segment in a specific use case (and possibly pricing advantages on a self-hosted vs. non). This would make it much easier to sell through procurement and devops for new customers that are switching.<p>4. There are going to be a lot of people like me that are soon to start fresh in terms of marketing stack, so going after people before they select Segment might also be a play.<p>Looking forward to seeing where you all take this. Good luck!
评论 #29109578 未加载
reidalertover 3 years ago
Congrats on your launch, and looks really exciting! I&#x27;m curious how this compares to tools like Snowplow [1]? I guess Jitsu comes with more sources and destinations out of the box?<p>[1] <a href="https:&#x2F;&#x2F;github.com&#x2F;snowplow&#x2F;snowplow" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;snowplow&#x2F;snowplow</a>
评论 #29111772 未加载
评论 #29107816 未加载
santiagobasultoover 3 years ago
I really like this, congratulations on the launch. And this is such a huge space that there&#x27;s definitively room for other options (aside from Segment).<p>I&#x27;m a little bit out of the loop in this event processing space. Do you think Jitsu could replace lower-level event processing implementations as Kafka&#x2F;Kinesis? Or this is thought for more &quot;high level&quot; marketing stuff.
评论 #29106517 未加载
public_voidover 3 years ago
Hey congrats on the launch, clearly a lot of thought and effort went into this. I&#x27;m pretty new to this space, and maybe this is a dumb question but how does this differ from Mixpanel? Would I use this for something different?
评论 #29110168 未加载
polskibusover 3 years ago
How do I scale jitsu if the load from my app servers become too big? Will adding more jitsu nodes trample the database nodes that jitsu writes to? How should I plan capacity for a jitsu deployment in a multi node scenario, and what should I take into consideration when scaling it?
评论 #29112675 未加载
tailspin2019over 3 years ago
This looks really cool. I&#x27;m keen to try it.<p>It looks like it might play well with my current logging system of choice, Seq [0].<p>Do you support inbound webhooks? I can see webhooks as a destination but not as a source?<p>[0] <a href="https:&#x2F;&#x2F;datalust.co&#x2F;seq" rel="nofollow">https:&#x2F;&#x2F;datalust.co&#x2F;seq</a>
评论 #29109618 未加载
polskibusover 3 years ago
Are there any examples on how the resultant SQL tables look like in postgres or clickhouse for a given event schema? I&#x27;d like to know how generic it is per event type (is it sth like (event id, blob), or tries to decompose each event field into a column - what about nested objects then, etc.)). Knowing this would greatly improve my understanding on reusability of jitsu for various event-collection tasks I may have.
评论 #29111540 未加载
laexover 3 years ago
Can we expect react native integration any time soon ?
评论 #29106535 未加载
0xferruccioover 3 years ago
Congrats on the launch, much needed! Would love to see if it&#x27;s possible for you to connect to <a href="https:&#x2F;&#x2F;june.so" rel="nofollow">https:&#x2F;&#x2F;june.so</a> for easy to use product analytics.<p>Are you following the same tracking convention spec as Segment?
评论 #29109583 未加载
einpoklumover 3 years ago
&gt; We help companies collect events from their apps, websites, and APIs and send them to databases.<p>For those who don&#x27;t know what &quot;Segment&quot; is (like me) - this Jitsu thing seems to only be relevant to web-based&#x2F;web-oriented apps.
评论 #29107104 未加载
评论 #29107048 未加载
评论 #29110995 未加载
yakkomajuriover 3 years ago
This is super cool, congrats!<p>The website is heavy on Segment comparisons (which makes sense). However, you&#x27;re not the only open source Segment alternative, so how do you view yourselves in comparison to e.g. RudderStack or Snowplow?
davidkellover 3 years ago
Many of your integrations talk about “syncing” rather than event collection, which to me sounds like what Fivetran is doing. Does that distinction make sense and how are you thinking about that?
评论 #29108121 未加载
tarun_anandover 3 years ago
Adding on to the previous comment, how does this compare to rudderstack?
评论 #29106691 未加载
sligover 3 years ago
Congrats on launching and thanks for making it easy to deploy using docker! I&#x27;d like to suggest that you make it available as a 1-click app on DigitalOcean as well.
dominotwover 3 years ago
Is <a href="https:&#x2F;&#x2F;meltano.com&#x2F;" rel="nofollow">https:&#x2F;&#x2F;meltano.com&#x2F;</a> a more general version of this ?<p>i am wondering how this compares to it?
评论 #29108439 未加载
baurover 3 years ago
Congrats on the launch!<p>Just wondering, do you have any plans to support CrateDB?<p>It supports SQL and understands PG protocol - perhaps supporting Postgres kinda already makes it close.
shchekleinover 3 years ago
Congrats on the launch! We&#x27;ve started using the open source version for one of the tools we are building (CLI anonymized telemetry) and it looks good so far- thank for the great product. It was very easy and straightforward to get started, deploy and start collecting things (into BigQuery).<p>Overall, I like this recent trend a lot - more companies are building open-source, lightweight, GDPR compatible analytics, chats (e.g. Papercups). I hope there will be good ways to monetize and sustain this. Wish you all the best, folks!
ramarnatover 3 years ago
CTO at Vertica here. How can we make sure Vertica is one of the DWH destinations in Jitsu?
评论 #29165007 未加载
adrianthedevover 3 years ago
Good job guys. Amazing work!<p>It was fun watching you grow Jitsu and love the way you provide support!
pjoterover 3 years ago
Do you support something similar as ajs_aid url param id override in Segment?
MaxiaNNover 3 years ago
Great story. How do you feel Jitsu compares to Rudderstack?
评论 #29110840 未加载
评论 #29110751 未加载
polskibusover 3 years ago
what value does the airbyte &#x2F; elt integration provide? Surely I could just run etls on airflow or similar on tables that jitsu generates?
ogazittover 3 years ago
Sweet! Great to see an OSS alternative to Segment.
ajboscoover 3 years ago
Why would I use this instead of Airbyte?
评论 #29110856 未加载
okhumanover 3 years ago
I miss the eventnative name :)
hasurabdover 3 years ago
Great product! Lovely team!
ComputerGuruover 3 years ago
The black banners at the top and bottom of your website breaks scrolling on mobile.
评论 #29111551 未加载
browsecover 3 years ago
Great job!
ThePhysicistover 3 years ago
Great work! It&#x27;s funny that YC funded Segment as well as two direct Segment competitors (RudderStack is the other one, though I think they initially started with a different idea an pivoted to that afterwards), though given the size of YC this is probably expected and Segment is probably large enough to &quot;deserve&quot; some good smaller competition.<p>As someone who also builds an open-core product (though not directly modeled after an existing closed-source product) I really hope this kind of business model will become more accepted.
评论 #29106568 未加载
评论 #29106536 未加载
评论 #29111543 未加载
finiteseriesover 3 years ago
Please consider posting some form of this as a blog post as well, I love to hear about product inceptions.<p>HN doesn’t allow lesser users with lesser eyesight to read light grey on beige self text, unfortunately.
评论 #29108868 未加载
leftnodeover 3 years ago
Congrats on the launch. Unrelated to the actual software, are any of you actual Jiu-Jitsu grapplers? If not, why&#x27;d you go with the name Jitsu?
评论 #29107943 未加载
评论 #29107986 未加载
nkotovover 3 years ago
Congrats on the launch!
xondonoover 3 years ago
Kudos on the work done and I hope you the best of luck.<p>A little note of advise: I wouldn’t start my company description as “the Y of X” or “the Y alternative to X”.<p>It’s okay to mention if you are similar to another well known company, but don’t use it to describe your company, specially not in the first line.
评论 #29108373 未加载
评论 #29108363 未加载
评论 #29108449 未加载
colesantiagoover 3 years ago
&gt; Jitsu solves the AdBlocker problem...<p>This line alone is enough to infuriate me. So I am unable to block spying and data collection now?<p>I don&#x27;t understand why we are still praising spyware tools?
评论 #29107573 未加载
评论 #29107500 未加载
评论 #29108417 未加载
评论 #29107294 未加载
评论 #29107486 未加载
simplyinfinityover 3 years ago
The name is 1 letter off of an already existing opensource project: Jitsi
评论 #29107149 未加载
评论 #29107188 未加载
评论 #29110786 未加载
tomislavpetover 3 years ago
Congrats on the launch!<p>Noticed a typo on jitsu.com - DHW should probably be DWH.
评论 #29106940 未加载
soumyadebover 3 years ago
Congrats on your launch. Great to see more innovation in this space. Segment deserves some serious competition.<p>-RudderStack team.
评论 #29107526 未加载