Advertisement · 728 × 90

Posts by DragonXI Development fellow

Preview
NVIDIA Isaac Platform Develop, train, simulate, deploy, operate, and optimize AI robot systems.

developer.nvidia.com/isaac

3 hours ago 0 0 0 0
cuTile Python — cuTile Python

cuTile Python
docs.nvidia.com/cuda/cutile-...

9 hours ago 1 0 0 0

Abstraction
#cuTile
abstracts
complexities of hardware,
but
#nvcc
remains a critical part of
backend toolkit that
#CUDA-Toolkit-13.1
uses to launch that work
onto individual threads.

11 hours ago 0 0 0 0

¿
Dependency
To run cuTile Python,
environment typically requires
#nvidia-cuda-nvcc-package
to be installed
alongside other components like
#tileiras (Tile IR compiler).

11 hours ago 0 0 0 0

¿
#Role-of-nvcc
in cuTile-Python
Compilation Driver
It manages process of turning
CUDA-related code into
fatbins or
#machine-ready-code-for-GPU

11 hours ago 0 0 0 0

¿
In this ecosystem,
nvcc serves as underlying
#compiler-drive
that handles compilation of CUDA code
into executable GPU instructions

11 hours ago 0 0 0 0

¿
While cuTile allows you to write
GPU kernels in Python,
it functions as a
#domain-specific-language(DSL)
that eventually translates code
into a machine representation.

11 hours ago 1 0 0 0

In
cuTile-Python
tile-based programming model,
nvcc stands for
#NVIDIA-CUDA-Compiler

11 hours ago 0 0 0 0
Advertisement

¤
what is
#nvcc
abbreviation
in
#cuTile-Python
tile based programming-model

11 hours ago 0 0 0 0
If this page does not refresh automatically, then please direct your browser to our latest docs.

nvidia.github.io/cuda-python/

22 hours ago 1 0 0 0
Preview
Simplify GPU Programming with NVIDIA CUDA Tile in Python | NVIDIA Technical Blog The release of NVIDIA CUDA 13.1 introduces tile-based programming for GPUs, making it one of the most fundamental additions to GPU programming since CUDA was invented. Writing GPU tile kernels enables...

developer.nvidia.com/blog/simplif...

22 hours ago 0 0 0 0
Preview
cuda-tile CUDA Tile Compiler

pypi.org/project/cuda...

22 hours ago 0 0 0 0
1. Introduction — NVIDIA CUDA Compiler Driver 13.2 documentation The documentation for nvcc, the CUDA compiler driver.

docs.nvidia.com/cuda/cuda-co...)

22 hours ago 0 0 0 0
2.5. NVCC: The NVIDIA CUDA Compiler — CUDA Programming Guide

docs.nvidia.com/cuda/cuda-pr...)

22 hours ago 0 0 0 0
Preview
Releases · NVIDIA/cuda-tile CUDA Tile IR is an MLIR-based intermediate representation and compiler infrastructure for CUDA kernel optimization, focusing on tile-based computation patterns and optimizations targeting NVIDIA te...

github.com/NVIDIA/cuda-...

22 hours ago 0 0 0 0
Preview
CUDA 13.2 Introduces Enhanced CUDA Tile Support and New Python Features | NVIDIA Technical Blog CUDA 13.2 arrives with a major update: NVIDIA CUDA Tile is now supported on devices of compute capability 8.X architectures (NVIDIA Ampere and NVIDIA Ada), as well as 10.X, 11.X and 12.

developer.nvidia.com/blog/cuda-13...

22 hours ago 0 0 0 0
Advertisement

¿
JIT Compilation
to JIT-compile at launch time in Python,
you can skip the manual tileiras step and
let cuda.tile runtime handle
translation of .tilebc files directly

22 hours ago 0 0 0 0

¿
Targeting Architectures
CUDA 13.2 expanded tile support
beyond Blackwell (10.x, 12.x)
to include Ampere and Ada Lovelace (8.x).

22 hours ago 0 0 0 0
Preview
cuda-tile CUDA Tile Compiler

pypi.org/project/cuda...

22 hours ago 0 0 0 0

¿
Package Installation
cuda-tile PyPI package
via
pip install cuda-tile

22 hours ago 0 0 0 0

¿
#Python-3.1-Integration
use high-level
#cuda-tile-package
to avoid manual nvcc calls

22 hours ago 0 0 0 0

¿
-arch / -gencode
required if you are embedding
tile kernels into a larger C++ application
to target specific compute capabilities.

22 hours ago 0 0 0 0

¿
#nvcc
(Standard CUDA Compiler):

22 hours ago 0 0 0 0

¿
--bytecode-version=13.2
version for explicit type tag versioning,
which is a new feature in CUDA 13.2.

22 hours ago 0 0 0 0

¿
#cuda-tile-translate
converts high-level
MLIR representations
to
Tile IR bytecode

22 hours ago 0 0 0 0
Advertisement

¿
--gpu-name
target architecture
#sm_100 for #Blackwell

22 hours ago 0 0 0 0

¿
#tileiras
Tile IR Assembler
compiles Tile IR bytecode
into
executable binaries
#cubins

22 hours ago 0 0 0 0

¿
While
traditional SIMT kernels
use
nvcc,
CUDA Tile kernels
often involve a new
#specialized-toolchain

22 hours ago 0 0 0 0

¿
When working with underlying
tools or compiling
#AOT (Ahead-of-Time)
the following parameters and tools
are essential

22 hours ago 1 0 0 0

¿
When working with underlying
tools or compiling
#AOT (Ahead-of-Time)
the following parameters and tools
are essential

22 hours ago 0 0 0 0