TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ask HN: From Batch to Streaming architecture using a Graph is this a good idea?

1 点作者 homosaphien大约 8 年前
Trying to query users from a TB scale cloud table. Based on thousands of different segments and generating a daily feed for each of these segments. At the end of every day we query the table to say for e.g. Create a subset of all users living in USA and generate a file for it.<p>We use to query the database every night but the number of segments have become so huge that its no longer feasible to batch query this data every night and populate segment feed. I am thinking of moving to Streaming architecture where users are assigned to the segment as they come in. For that I am looking to load segment definition into a graph data structure and determine which user qualifies for which segment. Any ideas if a graph will be appropriate. Vertices will contain filters e.g. country:USA, and segment_id. And edges will represent AND case between filters.<p>I do realize Amazon Kinesis has a continuous query model on its stream. Due to some condition&#x2F;limitations I am limited to Google cloud for this use case. I will be attempting to do this in Google cloud dataflow pipeline.<p>Any critique&#x2F;suggestion will be appreciated.

暂无评论

暂无评论