Skip to main content

Module advanced_gpu_profiler

Module advanced_gpu_profiler 

Source
Expand description

Advanced GPU profiling and kernel optimization tools

This module provides comprehensive GPU memory analysis, kernel optimization suggestions, and advanced profiling capabilities for CUDA/ROCm/OpenCL kernels.

Structs§

AccessLocalityMetrics
AdvancedGpuMemoryProfiler
Advanced GPU memory profiler with fragmentation analysis
AdvancedGpuProfilingConfig
Configuration for advanced GPU profiling
AllocationContext
AllocationHotSpot
AllocationPatternSummary
ArithmeticIntensityAnalyzer
ArithmeticIntensityProfile
BalancingStrategy
BandwidthSample
BandwidthSummary
BankConflictAnalyzer
BankConflictPattern
BottleneckFactor
CacheOptimization
CachePerformanceAnalysis
CoalescingAnalysis
CoalescingImprovement
ComputeBottleneckAnalysis
ComputeOptimizationOpportunity
ComputeUtilizationAnalyzer
Compute utilization analysis
ComputeUtilizationProfile
ConfigPerformanceMeasurement
ConflictResolutionStrategy
CrossDeviceTransfer
Cross-device memory transfer tracking
CrossDeviceTransferSummary
DetectedStride
ExpectedBenefit
ExpectedImprovement
FragmentationSummary
GpuBandwidthMonitor
GPU bandwidth monitoring
GpuMemoryAllocation
GPU memory allocation with detailed tracking
HighImpactOptimization
InstructionMixAnalysis
KernelExecutionProfile
KernelOptimization
KernelOptimizationSummaryReport
Summary report for kernel optimization
LaunchConfigAnalyzer
Launch configuration analysis
LaunchConfigSearchSpace
MemoryAccessAnalysis
MemoryAccessAnalyzer
Memory access pattern analysis
MemoryAccessPattern
MemoryAnalysisReport
MemoryFragmentationSnapshot
Memory fragmentation analysis
MemoryOptimizationRecommendation
MemoryPressureMonitor
Memory pressure monitoring
MemoryPressureSnapshot
MemoryPressureSummary
MemoryPressureThresholds
MemoryUsageStats
OptimalLaunchConfig
ResourceBalancer
ResourceProfile
ResourceUtilizationMetrics
RooflineModel
StrideAnalysisResult
SustainedBandwidthMeasurement
TransferBottleneck
UncoalescedRegion

Enums§

AllocationSource
CacheOptimizationType
CoalescingImprovementType
ComputeBottleneckType
ComputeOptimizationType
ConflictSeverity
CrossDeviceTransferType
FragmentationTrend
GpuMemoryType
ImplementationDifficulty
LaunchConstraint
LimitingFactor
MemoryOperationType
MemoryOptimizationType
MemoryPressureLevel
OptimizationPriority
OptimizationType
OptimizationValue
PressureTrend
ResolutionStrategyType
StridePattern
TransferBottleneckType