Jetbrains has one specifically for this: DataSpell - <a href="https://www.jetbrains.com/dataspell/" rel="nofollow">https://www.jetbrains.com/dataspell/</a>.<p>It's like a tweaked version of DataGrip + PyCharm, but catering specifically to the particular needs of data scientists.
Microsoft has a fork of VS Code called Azure Data Studio. It's made for DB queries and notebooks: <a href="https://github.com/Microsoft/azuredatastudio" rel="nofollow">https://github.com/Microsoft/azuredatastudio</a>
Hey, I'm Johannes - one of the maintainers of refinery. Thanks Jonathan for sharing!!<p>Would be super excited if you guys have any feedback. It's nowhere near perfect yet, but you can already use it to build some great data-centric use cases. Amongst others for sentiment analysis, conversational AI or finetuning of your embeddings (which you can check out here: <a href="https://github.com/code-kern-ai/refinery-sample-projects" rel="nofollow">https://github.com/code-kern-ai/refinery-sample-projects</a>).<p>Let me know what you think :)
The title should probably reflect that this is specifically for managing NLP labeling tasks. It looks like a great project! Years ago I bought a book on data prep and labeling for NLP, and based on that book this project looks like it covers the main workflows you would need.
In case you're looking for a VS Code extension to quickly preview, filter and plot data from various file formats you can check out vscode-data-preview [0]<p>[0]: <a href="https://marketplace.visualstudio.com/items?itemName=RandomFractalsInc.vscode-data-preview" rel="nofollow">https://marketplace.visualstudio.com/items?itemName=RandomFr...</a>
This actually seems like a major leap forward in a really underloved space. Congratulations on your release.<p>If anyone is remotely interested in data-labelling/exploration, I would definitely recommend checking this out, it has some really exciting features, for example, built-in zero shot classification for heuristics/baselines: <a href="https://docs.kern.ai/docs/building-zero-shot-classifiers" rel="nofollow">https://docs.kern.ai/docs/building-zero-shot-classifiers</a><p>I'm also really impressed with the architecture! Very neat.<p>Not affiliated with the project, just very pleased to see something like this as an open source release.
I'm fond of Visidata, not an quite IDE but a good start<p><a href="https://www.visidata.org/" rel="nofollow">https://www.visidata.org/</a>
As I work mostly on MSSQL Server, the sweet spot for me is SSMS with RedGate SQL Prompt.<p>For non-MSSQL things, it's usually DBeaver.<p>No need for a new IDE for me.
Kind of hard to imagine a VS Code sibling for the whole data centric eco system. Maybe something like a base platform with multiple extension points for different tasks and the ability for others to extend the platform? (so like extensions in VS Code)
Hey, you mention that it is open-source but I cannot actually see the source code in that repo.<p>EDIT: I can some sources in other repo of the same org, for instance: <a href="https://github.com/code-kern-ai/refinery-ui" rel="nofollow">https://github.com/code-kern-ai/refinery-ui</a> so it's just a matter of making it easy for dev to navigate the code.