TechEcho

10 comments

fpgaminerabout 2 years ago

The thing I'm looking forward to most is having Flash Attention built-in. Right now you have to use xformers or similar, but that dependency has been a nightmare to use, from breaking, to requiring specific concoctions of installing dependencies or else conda will barf, to being impossible to pin because I have to use -dev releases which they constantly drop from the repositories.PyTorch 2.0 comes with a few different efficient transformer implementations built-in. And unlike 1.13, they work during training and don't require specific configurations. Seemed to work just fine during my pre-release testing. Also, having it built into PyTorch might mean more pressure to keep it optimized. As-is xformers targets A100 primarily, with other archs as an afterthought.And, as promised, `torch.compile` worked out of the box, providing IIRC a nice ~20% speed up on a ViT without any other tuning.I did have to do some dependency fiddling on the pre-release version. Been looking forward to the "stable" release before using it more extensively.Anyone else seeing nice boosts from `torch.compile`?

评论 #35176761 未加载

评论 #35189602 未加载

评论 #35181440 未加载

评论 #35177851 未加载

mardifoufsabout 2 years ago

>Python 3.11 support on Anaconda Platform>Due to lack of Python 3.11 support for packages that PyTorch depends on, including NumPy, SciPy, SymPy, Pillow and others on the Anaconda platform. We will not be releasing Conda binaries compiled with Python 3.11 for PyTorch Release 2.0. The Pip packages with Python 3.11 support will be released, hence if you intend to use PyTorch 2.0 with Python 3.11 please use our Pip packages.It really sucks that anaconda always lags behind. I know the reasoning*, and I know it makes sense for what a lot of teams use it for... but on our side we are now looking more and more into dropping it since we are more of an R&D team. We already use containers for most of our pipelines, so just using pip might be viable.*Though I guess Anaconda chewed more than it can handle w.r.t managing an entire Python universe, and keeping up to date. Conda-forge is already almost a requirement but using the official package (with pip, in this case) has its own benefits for very complex packages like pytorch.

评论 #35175677 未加载

评论 #35175899 未加载

评论 #35175747 未加载

评论 #35177250 未加载

评论 #35176869 未加载

评论 #35179975 未加载

评论 #35180473 未加载

brucethemoose2about 2 years ago

I'm hoping torch.compile is a gateway to "easy" non-Nvidia accelerator support in PyTorch.Also, I have been using torch.compile for the Stable Diffusion unet/vae since February, to good effect. I'm guessing similar optimizations will pop up for LLaMA.

评论 #35174833 未加载

评论 #35175242 未加载

singularity2001about 2 years ago

100% backward compatibleThat's (for me) the biggest reason why tensor flow fell out of flavor: the API broke too often (not just between tf 1 and 2)

simonwabout 2 years ago

"the MPS backend" - that's the thing that lets Torch run accelerated on M1/M2 Macs!

评论 #35174881 未加载

评论 #35183043 未加载

评论 #35175265 未加载

评论 #35181350 未加载

lucasapabout 2 years ago

If anyone can edit it, I found a typo:> Python 1.8 (deprecating Python 1.7)> Deprecation of Cuda 11.6 and Python 1.7 support for PyTorch 2.0It is clearly supposed to be python 3.8 and 3.7 respectively.

tormehabout 2 years ago

Hopefully the AMD support doesn't just come in the form of ROCm...

yumrajabout 2 years ago

No CUDA 12 support unfortunately..

评论 #35176618 未加载

评论 #35180638 未加载

marvielabout 2 years ago

> As an underpinning technology of torch.compile, TorchInductor with Nvidia and AMD GPUs will rely on OpenAI Triton deep learning compiler to generate performant code and hide low level hardware details. OpenAI Triton-generated kernels achieve performance that’s on par with hand-written kernels and specialized cuda libraries such as cublas.

mdanielabout 2 years ago

discussion from (presumably) the PyTorch Conference announcement: <a href="https://news.ycombinator.com/item?id=33832511" rel="nofollow">https://news.ycombinator.com/item?id=33832511</a>