TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Replibyte – Seed your database with real data

222 点作者 evoxmusic将近 3 年前

6 条评论

ff7c11将近 3 年前
Trying to think how to anonymise datetimes hurts my head. You might want to randomise the date of an event. But you also need this random date to be consistent with respect to both the current time and the order of other related rows in the database.
评论 #32051583 未加载
bennyp101将近 3 年前
How does it keep personal data safe? I had a look at “how it works” and “faqs” but they don’t answer how you keep stuff safe? It also gets uploaded to S3?<p>I might have missed it, but I need to know exactly where our PII is stored (so not on a dev laptop), how do you know what to replace and what do you do with any info you do replace?<p>Edit: To answer my own question, via transformers. But that seems to suggest each dev has to keep it up to date with any schema changes etc<p>(Also some links are broken on GitHub)
评论 #32048801 未加载
评论 #32049519 未加载
评论 #32049216 未加载
roskilli将近 3 年前
One feature I’d love to see is a transformer that instead of providing a random value provides a cryptographic one way hash of the data (ie sha2) - that way key uniqueness stays the same (to avoid unique constraints on columns) and also the same value used in one place will match another value in another table after transformation which more accurately reflects the “shape” of the data.
评论 #32049232 未加载
评论 #32049011 未加载
评论 #32057251 未加载
评论 #32050932 未加载
评论 #32049550 未加载
zX41ZdbW将近 3 年前
I recommend checking out clickhouse-obfuscator. It&#x27;s a more sophisticated tool for dataset obfuscation.<p>Installation (single binary Linux&#x2F;Mac&#x2F;FreeBSD):<p>curl <a href="https:&#x2F;&#x2F;clickhouse.com&#x2F;" rel="nofollow">https:&#x2F;&#x2F;clickhouse.com&#x2F;</a> | sh<p>.&#x2F;clickhouse obfuscator --help<p>Docs: <a href="https:&#x2F;&#x2F;clickhouse.com&#x2F;docs&#x2F;en&#x2F;operations&#x2F;utilities&#x2F;clickhouse-obfuscator&#x2F;" rel="nofollow">https:&#x2F;&#x2F;clickhouse.com&#x2F;docs&#x2F;en&#x2F;operations&#x2F;utilities&#x2F;clickhou...</a>
评论 #32053792 未加载
dopidopHN将近 3 年前
The default seems to be to store the sanitized dump on S3.<p>It’s not always available in a professional context. Or might be considered extraction.<p>Keeping everything local and detailing exactly what goes where and how would be helpful.
评论 #32048672 未加载
评论 #32049718 未加载
CSSer将近 3 年前
I think the description in the man entry is better than the one in the README. Other than that, cool tool!