Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
gpu
Follow
Hide
Posts
Left menu
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
NVIDIA 50-Series GDDR7 Rumors, Mesa 26.1 AMD APU Drivers, WebGPU 1-bit LLMs
soy
soy
soy
Follow
Apr 15
NVIDIA 50-Series GDDR7 Rumors, Mesa 26.1 AMD APU Drivers, WebGPU 1-bit LLMs
#
gpu
#
nvidia
#
hardware
Comments
Add Comment
4 min read
LLM Auto-Tunes llama.cpp, SASS Latency Analysis, DLSS Frame Gen for RTX 40
soy
soy
soy
Follow
Apr 14
LLM Auto-Tunes llama.cpp, SASS Latency Analysis, DLSS Frame Gen for RTX 40
#
gpu
#
nvidia
#
hardware
Comments
Add Comment
3 min read
VRAMを増やせば解決する、は物理的に間違っている — HBM・CXL・Unified Memoryが取れなかったもの
plasmon
plasmon
plasmon
Follow
Apr 14
VRAMを増やせば解決する、は物理的に間違っている — HBM・CXL・Unified Memoryが取れなかったもの
#
llm
#
gpu
#
vram
Comments
Add Comment
4 min read
llama.cppの設定で8GBの性能が5倍変わる — 主要オプションの最適値を出した
plasmon
plasmon
plasmon
Follow
Apr 14
llama.cppの設定で8GBの性能が5倍変わる — 主要オプションの最適値を出した
#
llm
#
llamacpp
#
gpu
Comments
Add Comment
4 min read
One Query, Four GPUs: Tracing a Distributed Training Stall Across Nodes
Ingero Team
Ingero Team
Ingero Team
Follow
Apr 13
One Query, Four GPUs: Tracing a Distributed Training Stall Across Nodes
#
gpu
#
ebpf
#
distributedcomputing
Comments
Add Comment
7 min read
Task Manager is lying about your GPU temps. Here is how to read the real data in Python
Yaroslav Pristupa
Yaroslav Pristupa
Yaroslav Pristupa
Follow
Apr 13
Task Manager is lying about your GPU temps. Here is how to read the real data in Python
#
ai
#
hardware
#
softwaredevelopment
#
gpu
Comments
Add Comment
4 min read
AMD ML Complete Stack
compilersutra
compilersutra
compilersutra
Follow
Apr 12
AMD ML Complete Stack
#
gpu
#
cpu
#
ai
#
llm
Comments
Add Comment
1 min read
RTX 5090 cuBLAS Bug, Neural Texture Compression, Multi-GPU vLLM Inference
soy
soy
soy
Follow
Apr 11
RTX 5090 cuBLAS Bug, Neural Texture Compression, Multi-GPU vLLM Inference
#
gpu
#
nvidia
#
hardware
Comments
Add Comment
3 min read
A Complete Guide to Real-Time GPU Usage Monitoring
James Skelton
James Skelton
James Skelton
Follow
for
DigitalOcean
Apr 15
A Complete Guide to Real-Time GPU Usage Monitoring
#
gpu
#
ai
#
tutorial
#
hardware
1
reaction
Comments
Add Comment
14 min read
CUDA SGEMM Bug on RTX 5090, Kernel-Fusing for SGEMV, & Radeon RX 9070 XT Price Surge
soy
soy
soy
Follow
Apr 10
CUDA SGEMM Bug on RTX 5090, Kernel-Fusing for SGEMV, & Radeon RX 9070 XT Price Surge
#
gpu
#
nvidia
#
hardware
Comments
Add Comment
4 min read
TGI - Text Generation Inference - Install, Config, Troubleshoot
Rost
Rost
Rost
Follow
Apr 10
TGI - Text Generation Inference - Install, Config, Troubleshoot
#
docker
#
gpu
#
observability
#
selfhosting
Comments
Add Comment
9 min read
Memory Coalescing: Same computation, 6x Performance Difference
Myoungho Shin
Myoungho Shin
Myoungho Shin
Follow
Apr 9
Memory Coalescing: Same computation, 6x Performance Difference
#
cuda
#
gpu
#
aiops
#
cpp
Comments
Add Comment
6 min read
LLM GPU Breakthroughs: RT Cores, Llama.cpp Parallelism, AMD Optimizations
soy
soy
soy
Follow
Apr 9
LLM GPU Breakthroughs: RT Cores, Llama.cpp Parallelism, AMD Optimizations
#
gpu
#
nvidia
#
hardware
Comments
Add Comment
3 min read
How to Train a 100B+ Parameter Model When You Can't Afford a GPU Cluster
Alan West
Alan West
Alan West
Follow
Apr 9
How to Train a 100B+ Parameter Model When You Can't Afford a GPU Cluster
#
machinelearning
#
deeplearning
#
python
#
gpu
Comments
1
comment
5 min read
How K-Means Clustering Works (Explained by Extracting Colors from Images)
Francesco Di Donato
Francesco Di Donato
Francesco Di Donato
Follow
Apr 9
How K-Means Clustering Works (Explained by Extracting Colors from Images)
#
webgl
#
machinelearning
#
javascript
#
gpu
1
reaction
Comments
Add Comment
3 min read
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account