TechEcho

7 comments

jpxwabout 5 years ago

Something I love about pandas is that often you can pass a URL in place of a file name.The other day I needed to scrape data from a table on a webpage. Thinking about traversing the DOM and building up an array was already giving me a headache. Thankfully pandas has the “read_html” function. Getting a list of dataframes for each table on the page was as easy as:<pre><code> dfs = pd.read_html(url)</code></pre>

评论 #22544896 未加载

评论 #22546376 未加载

评论 #22544916 未加载

aksakalliabout 5 years ago

Medium wants me to upgrade my account to read this article, please people share your posts in somewhere else.

评论 #22544946 未加载

评论 #22544961 未加载

评论 #22544912 未加载

andreareinaabout 5 years ago

Merge with indicator is also useful for doing anti-joins:<pre><code> left.merge(right, how="left", indicator=True, ...) [lambda df: df._merge == "left_only"]</code></pre>

staticautomaticabout 5 years ago

My favorite, most elegant SO answer I've ever gotten was to a question about Pandas.The question was "How do I create a column where each row's value is the mean of another column's values starting at that row?" The answer was:<pre><code> df.loc[::-1, 'col_1'].expanding().mean()[::-1]</code></pre>

评论 #22550193 未加载

评论 #22552058 未加载

closedabout 5 years ago

Note that there is a handy PeriodIndex version of pd.date_range:<pre><code> pd.period_range(date_from, date_to, freq = "D") </code></pre> AFAICT, a PeriodIndex and DateTimeIndex function mostly the same, and have many of the same methods, except...<pre><code> * DateTimeIndex can't hold dates far in the future * PeriodIndex can't easily round to the end of a period (e.g. date + 0*MonthEnd() errors) * PeriodIndex doesn't handle timezones?</code></pre>

HIP_HOPabout 5 years ago

TLDR;5 lesser-known pandas tricks:1. Date Ranges2. Merge with indicator3. Nearest merge by timestamp4. Create an Excel report from pandas5. Use gzip with when saving to csv

collywabout 5 years ago

Does anyone want to do a TLDR? I don't especially want to sign into Medium.

评论 #22544910 未加载

7 comments

jpxwabout 5 years ago

评论 #22544896 未加载

评论 #22546376 未加载

评论 #22544916 未加载

aksakalliabout 5 years ago

Medium wants me to upgrade my account to read this article, please people share your posts in somewhere else.

评论 #22544946 未加载

评论 #22544961 未加载

评论 #22544912 未加载

andreareinaabout 5 years ago

Merge with indicator is also useful for doing anti-joins:<pre><code> left.merge(right, how="left", indicator=True, ...) [lambda df: df._merge == "left_only"]</code></pre>

staticautomaticabout 5 years ago

评论 #22550193 未加载

评论 #22552058 未加载

closedabout 5 years ago

HIP_HOPabout 5 years ago

TLDR;5 lesser-known pandas tricks:1. Date Ranges2. Merge with indicator3. Nearest merge by timestamp4. Create an Excel report from pandas5. Use gzip with when saving to csv

collywabout 5 years ago

Does anyone want to do a TLDR? I don't especially want to sign into Medium.

评论 #22544910 未加载

Lesser-known Pandas tricks (2019)

7 comments

Lesser-known Pandas tricks (2019)

7 comments