Top Down Performance Profiling On Nvidias Gpus