TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Fsspec: Filesystem Interfaces for Python

129 pointsby giladalmost 4 years ago

10 comments

remramalmost 4 years ago
See also smart_open: <a href="https:&#x2F;&#x2F;github.com&#x2F;RaRe-Technologies&#x2F;smart_open" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;RaRe-Technologies&#x2F;smart_open</a> which might be more user-friendly? Never used it myself but it was on HN before. Discussion on their bugtracker: <a href="https:&#x2F;&#x2F;github.com&#x2F;RaRe-Technologies&#x2F;smart_open&#x2F;issues&#x2F;579" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;RaRe-Technologies&#x2F;smart_open&#x2F;issues&#x2F;579</a><p>Personally I wasn&#x27;t too impressed with fsspec, though I am using it. I had to wrap files with Python&#x27;s io wrappers to get acceptable performance, and found that different fsspec implementations still have significant differences that you can&#x27;t ignore. They don&#x27;t seem interested in supporting or documenting use cases outside of Pandas and Dask.
评论 #27528306 未加载
评论 #27525159 未加载
评论 #27526119 未加载
MrPowersalmost 4 years ago
fsspec is used in dask (<a href="https:&#x2F;&#x2F;github.com&#x2F;dask&#x2F;dask" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;dask&#x2F;dask</a>) if you want to see it in action.<p>The author of fsspec also created fastparquet (<a href="https:&#x2F;&#x2F;github.com&#x2F;dask&#x2F;fastparquet" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;dask&#x2F;fastparquet</a>), a native Python implementation of the Parquet file format.<p>I&#x27;m really appreciative for Martin&#x27;s extensive contributions to the PyData ecosystem.
评论 #27525879 未加载
dragonshalmost 4 years ago
How does it compare to pyfilesystems [1]. In our startup using pyfilesystems as a generic interface for our webapp which makes it cloud agnostic and the app can use amazon, gcs, NFS, gzip, tar, azure or regular file system.<p>PyFilesystem is a Python module that provides a common interface to any filesystem. It supports many filesystems [2].<p>[1] <a href="https:&#x2F;&#x2F;www.pyfilesystem.org&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.pyfilesystem.org&#x2F;</a><p>[2] <a href="https:&#x2F;&#x2F;www.pyfilesystem.org&#x2F;page&#x2F;index-of-filesystems&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.pyfilesystem.org&#x2F;page&#x2F;index-of-filesystems&#x2F;</a>
评论 #27523901 未加载
shchekleinalmost 4 years ago
Our team has been migrating DVC to fsspec. We&#x27;ve even started working on creating fsspec compatible wrappers for SSH, Alibaba cloud, etc.<p>There were challenges indeed, and some storages will require more work (e.g. things like GDrive) but I would say maintainers are responsive and helpful.
chaz6almost 4 years ago
One feature of a filesystem library I would like is to convert a filename into one that only includes permitted characters for a given filesystem type (for example, &quot;:&quot; and &quot;?&quot; are allowed in xfs but not exfat). For instance, mystring = fs.util.convert_name(u&quot;What time is it?.txt&quot;, fs.fstype.exfat, &quot;_&quot;) where &quot;_&quot; is the character to be substituted.
评论 #27528762 未加载
piqufohalmost 4 years ago
Just a heads up - fsspec has an (optional) dependency on s3fs which has a requirement on aiobotocore, which in turn is currently locked to an ancient boto3 version. Versioning fsspec[s3fs] in projects with other boto3 dependencies is a nightmare.
评论 #27528150 未加载
epylaralmost 4 years ago
If there is an in-memory filesystem library for this, it could be useful for unit testing things that want to write files.
评论 #27528183 未加载
评论 #27525249 未加载
nivekkevinalmost 4 years ago
It&#x27;s used in pandas and I love it!
rahimialialmost 4 years ago
it&#x27;s super convenient, but watch out for bugs and performance issues. it&#x27;s caching mechanism, how it uploads to S3, and how it seeks unseekable files conceal pretty big performance bottlenecks.
评论 #27528194 未加载
rektidealmost 4 years ago
this is what the web&#x27;s upcoming File System Access[1] really enables, but stealtily so. it&#x27;s advertised as an implementation, as the capabilities to interact with the filesystem. but it&#x27;s also an interface too. JS having a interface for filesystems is going to be extremely great.<p>[1] <a href="https:&#x2F;&#x2F;wicg.github.io&#x2F;file-system-access&#x2F;" rel="nofollow">https:&#x2F;&#x2F;wicg.github.io&#x2F;file-system-access&#x2F;</a>