Hello HN! I've been trying out some AMD gpus for Gen AI workloads lately and found the state of monitoring quite unsatisfying.
The best option out there, `nvtop`, has some very strict correctness asserts crashing it too often. With `picomon` I tradeoff some of that accuracy for increased reliability.
If your amd-smi version is incompatible, please open an issue and I'll be happy to fix that. Happy hacking!
Hello HN! I've been trying out some AMD gpus for Gen AI workloads lately and found the state of monitoring quite unsatisfying.
The best option out there, `nvtop`, has some very strict correctness asserts crashing it too often. With `picomon` I tradeoff some of that accuracy for increased reliability.
If your amd-smi version is incompatible, please open an issue and I'll be happy to fix that. Happy hacking!