15 points | by eigenBasis 5 days ago
2 comments
Aren't there other options for custom PyTorch operators before going so low-level as CUDA C++, like using Jax or CuPy?
neat -- how to add pt2 support and how to allow fusion of your custom ops with normal ops by inductor?
Aren't there other options for custom PyTorch operators before going so low-level as CUDA C++, like using Jax or CuPy?
neat -- how to add pt2 support and how to allow fusion of your custom ops with normal ops by inductor?