Shangren Lu
NotesTagsGraph

2026  Shangren Lu

  1. Home
  2. /Notes
  3. /CuPerf

CuPerf

2026-01-13·1 min·80 words

C++23 CLI tool for NVIDIA GPU benchmarking with extensible architecture.

Filed underproject · cuda · gpu · benchmarking

Contents

  1. Features
  2. Technologies
  3. Links
  4. Status
  5. See Also

C++23 CLI tool for NVIDIA GPU benchmarking (memory, compute, tensor cores) with extensible architecture and JSON/CSV output.

Features

  • Memory bandwidth, compute throughput, and tensor core benchmarking
  • Kernel launch overhead and reduction performance measurements
  • Multiple data types (FP32, FP16, BF16, INT8, FP4)
  • Extensible architecture
  • Multiple output formats (console, JSON, CSV)

Technologies

CUDA, C++23, Parallel Computing, Profiling

Links

  • GitHub Repository

Status

Active Development - 2025 -- Present

See Also

  • Resume - See this project in my resume
  • Projects - All my projects

Referenced by

[[CuPerf]]

  • Projects2025-01-01

CuPerf

  • Resume2025-01-01