TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Ask HN: Learning PySpark and Related Tools

9 pointsby rookie1233 months ago
Hey HN,<p>I have been working in the data-science and machine-learning domain for the past 8 years or so. I have not been exposed to tools such as PySpark etc. which are being asked frequently in job descriptions. What resource or certification can I use to get upto par on PySpark?<p>Thanks!

6 comments

almosthere3 months ago
Having used spark for the past 8 years or so, it&#x27;s definitely a solid basic for data engineering. I use it for generating reports the most, but sometimes we have large projects to get data into different staging databases. I use it a lot with ElasticSearch or a parquet. Basically it helps you write large joins and flatten the result to a database that can more quickly perform aggregations on that flattened result (like Elasticsearch) or a columnar database.
datadrivenangel3 months ago
If you have experience in any data frame library (like Pandas), and SQL, you can pick up PySpark pretty easily... With the one caveat that writing good data pipelines in any language gets much harder when you start looking at ways to actually processes big data (~20+TB). Modern SQL engines are so good though.
philomath_mn3 months ago
I thought the _Learning Spark_ book was a pretty good introduction. Databricks offers it for free here [0]<p>[0] <a href="https:&#x2F;&#x2F;pages.databricks.com&#x2F;rs&#x2F;094-YMS-629&#x2F;images&#x2F;LearningSpark2.0.pdf" rel="nofollow">https:&#x2F;&#x2F;pages.databricks.com&#x2F;rs&#x2F;094-YMS-629&#x2F;images&#x2F;LearningS...</a>
francocalvo3 months ago
I&#x27;m a Data Engineer which uses Spark daily. I guess the only important cert would come from Databricks, but I think it will be more worth your while to read the book mentioned here and try to do a little project ingesting&#x2F;transforming data
hnthrowaway03153 months ago
Just get a job since you are already senior. You can learn it on the job. Find a few tutorials if you must, but people should be able to pick it up in a few weeks for basic work.
rookie1233 months ago
Bump!