Is anybody storing non-text files in S3 for their software? I'm referring to pdf, video, audio, images, etc.<p>If so, it would mean the world if I could talk to you and just ask about some of your experience with uploading/retrieving these files within your software.<p>To give context: I have a free managed API that offers full-text-search for your S3 bucket to integrate into your software. The website is https://mixpeek.com<p>Let me know if this matches you and if you're willing to answer some quick Qs in exchange to be involved in the free pilot :)
What you describe is part of the AWS "Well Architected Framework". Static files should, as much as possible, be hosted on S3. This takes the pressure off the web/app server, is cheaper, increases resiliency to DDOS attacks, and allows much faster loading, particularly of video files, when one integrates the S3 storage with a CDN (such as CloudFront).<p>What I would find interesting is anyone running a SaaS project and NOT using S3/object storage for a significant amount of their files.
I think there's a typo in the code example for the Python client that would result in a NameError if run<p><pre><code> from mixpeek import mixpeek
</code></pre>
Should be<p><pre><code> from mixpeek import Mixpeek
</code></pre>
Given that it is the class instantiated below.<p>One point in the Healthcare and Insurance use case reads as follows<p>><i>Ensure the are up to date Insurance Documents</i><p>Do you mean "Keep Insurance Documents up to date"?<p>We use S3 for <a href="https://iko.ai" rel="nofollow">https://iko.ai</a> and give a filesystem like experience for notebook users. They can 'cd', 'ls', etc on S3 buckets right from a Jupyter notebook.<p>We both integrate with external buckets, private or public, but also use an S3 compatible object storage (MinIO), for which I wrote this library: <a href="https://pypi.org/project/bmc/" rel="nofollow">https://pypi.org/project/bmc/</a><p>One thing I find will hinder your product's adoption is the "/upload" route. If I were to use your tool, I'd want to use it to <i>search</i> through already existing objects in my bucket, not to upload to the bucket.<p>That was one objection we faced, and then we added the ability to mount S3 buckets because people had datasets of 1TB or so in existing buckets.