TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Dplython: Dplyr for Python

61 pointsby kiechuabout 9 years ago

6 comments

carljvabout 9 years ago
This is an interesting project, and it&#x27;s illuminating to see what it takes to emulate some R features in Python (custom infix ops, non-standard evaluation, dataframes as namespaces&#x2F;envrionments, etc.)<p>But I feel like it would be better to use method chaining for the piping of transformation rather than overloading dunder method operators. It would preserve one of the nice things about dplyr -- composing complicated transformations from a simple vocabulary, but more pythonic. This is a relative weakness I see in the design of pandas and would love to see ported over.<p>But also, dplyr is a thing that really goes beyond pandas. It&#x27;s really an elegant, SQL-like DSL for transforming (mostly) arbitrary data. In this way it&#x27;s more like LINQ than a specific implementation&#x2F;API of a data structure.
CurtHagenlocherabout 9 years ago
It looks like both Dplython and pandas-ply are missing one of (what I think is) the core value propositions of dplyr: the ability to use the same abstractions on local data and on remote data, with execution against the remote source happening lazily such that the entire table doesn&#x27;t need to be downloaded in order to run a filter locally.<p>(Of course, I may be biased in that I work on a commercial product which also has this characteristic.)
评论 #11336283 未加载
评论 #11337194 未加载
评论 #11336356 未加载
baldfatabout 9 years ago
Piping in R is actually from <a href="https:&#x2F;&#x2F;github.com&#x2F;smbache&#x2F;magrittr" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;smbache&#x2F;magrittr</a> and not Dplyr and is actually inspired from F#.<p>&quot;R package to bring forward-piping features ala F#&#x27;s |&gt; operator.&quot;
评论 #11336325 未加载
dandermotjabout 9 years ago
Looks like Hadleyverse is spreading! Honestly, I think his contributions have led the way for R to being the leading language in data science.
staredabout 9 years ago
I was not aware it is possible with Python; but now I see: <a href="http:&#x2F;&#x2F;stackoverflow.com&#x2F;questions&#x2F;33658355&#x2F;piping-output-from-one-function-to-another-using-python-infix-syntax" rel="nofollow">http:&#x2F;&#x2F;stackoverflow.com&#x2F;questions&#x2F;33658355&#x2F;piping-output-fr...</a>
lamecicleabout 9 years ago
I&#x27;m hugely excited by this.