CUDA 記憶體除錯：memcheck，racecheck 【讀書筆記】

阿新 • • 發佈：2018-12-18

Usage: cuda-memcheck [options] [your-program] [your-program-options]
Options:
 --binary-patching <yes|no>  [Default : yes]
                       Control the binary patching of the device code. This is enabled by default.
                       Disabling this option will result in a loss of precision for error reporting.
 --check-api-memory-access <yes|no> [Default : yes]
                       Check cudaMemcpy/cudaMemset for accesses to device memory
 --check-deprecated-instr <yes|no>  [Default : no]
                       Check for usage of deprecated instructions.
                       If deprecated instruction usage is found, an error will be reported.
                       Which instructions are checked might depend on the selected tool.
                       This is disabled by default.
 --check-device-heap <yes|no>  [Default : yes]
                       Check allocations on the device heap. This is enabled by default.
 --demangle <full|simple|no>  [Default : full]
                       Demangle function names
                       full   : Show full name and prototype
                       simple : Show only device kernel name
                       no     : Show mangled names
 --destroy-on-device-error <context|kernel>   [Default : context]
                       Behavior of cuda-memcheck on a precise device error.
                       NOTE: Imprecise errors  will always destroy the context.
                       context : CUDA Context is terminated with an error.
                       kernel  : Kernel is terminated. Subsequent kernel launches are still allowed.
 --error-exitcode <number> [Default : 0]
                       When this is set, memcheck will return the given exitcode when any errors are detected
 --filter key1=val1,key2=val2,...
                       The filter option can be used to control the kernels that will be checked by the tool
                       Multiple filter options can be defined. Each option is additive, so kernels matching
                       any specified filter will be checked
                       Filters are specified as key value pairs, with each pair separated by a ','
                       Keys have both a long form, and a shorter form for convenience
                       Valid values for keys are:
                           kernel_name, kne      : The value is the full demangled name of the kernel
                           kernel_substring, kns : The value is a substring present in the demangled name of the kernel
                       NOTE: The name and substring keys cannot be simultaneously specified
 --flush-to-disk <yes|no>   [Default : no]
                       Flush errors to disk. This can be enabled to ensure all errors are flushed down
 --force-blocking-launches <yes|no>   [Default : no]
                       Force launches to be blocking.
 -h | --help           Show this message.
 --help-debug          Show information about debug only flags
 --language <c|fortran> [Default : c]
                       This option can be used to enable language specific behavior. When set to fortan, the thread and block indices
                       of messages printed by cuda-memcheck will start with 1-based offset to match Fortran semantics.
 --log-file <string>   File where cuda-memcheck will write all of its text output. If not specified, memcheck output is written to stdout.
                       The sequence %p in the string name will be replaced by the pid of the cuda-memcheck application.
                       The sequence %q{FOO} will be replaced by the value of the environment variable FOO. If the environment variable
                       is not defined, it will be replaced by an empty string.
                       The sequence %% is replaced with a literal % in the file name.
                       Any other character following % will cause the entire string to be ignored.
                       If the file cannot be written to for any reason including an invalid path, insufficient permissions or disk being full
                       the output will go to stdout
 --leak-check <full|no> [Default : no]
                       Print leak information for CUDA allocations.
                       NOTE: Program must end with cudaDeviceReset() for this to work.
 --prefix <string>     Changes the prefix string displayed by cuda-memcheck.
 --print-level <info|warn|error|fatal> [Default : warn]
                       Set the minimum level of errors to print
 --print-limit <number> [Default is : 10000]
                       When this is set, memcheck will stop printing errors after reaching the given number
                       of errors. Use 0 for unlimited printing.
 --read <file>         Reads error records from a given file.
 --racecheck-report <all|hazard|analysis>  [Default : analysis]
                       The reporting mode that applies to racecheck.
                       all      : Report all hazards and race analysis reports.
                       hazard   : Report only hazards.
                       analysis : Report only race analysis results.
 --report-api-errors <all|explicit|no> [Default : explicit]
                       Print errors if any API call fails
                       all      : Report all CUDA API errors, including those APIs invoked implicitly
                       explicit : Report errors in explicit CUDA API calls only
                       no       : Disable reporting of CUDA API errors
 --save <file>         Saves the error record to file.
                       The sequence %p in the string name will be replaced by the pid of the cuda-memcheck application.
                       The sequence %q{FOO} will be replaced by the value of the environment variable FOO. If the environment variable
                       is not defined, it will be replaced by an empty string.
                       The sequence %% is replaced with a literal % in the file name.
                       Any other character following % will cause an error.
 --show-backtrace <yes|host|device|no> [Default : yes]
                       Display a backtrace on error.
                       no     : No backtrace shown
                       host   : Only host backtrace shown
                       device : Only device backtrace shown for precise errors
                       yes    : Host and device backtraces shown
                       See the manual for more information
 --tool <memcheck|racecheck|synccheck|initcheck>  [Default : memcheck]
                       Set the tool to use.
                       memcheck    : Memory access checking
                       racecheck   : Shared memory hazard checking
                       Note : This disables memcheck, so make sure the app is error free.
                       synccheck   : Synchronization checking
                       initcheck   : Global memory initialization checking
 --track-unused-memory <yes|no> [Default : no]
                       Check for unused memory allocations. This requires initcheck tool.
 -V | --version        Print the version of cuda-memcheck.

Please see the cuda-memcheck manual for more information.

CUDA 記憶體除錯：memcheck，racecheck 【讀書筆記】

Usage: cuda-memcheck [options] [your-program] [your-program-options] Options: --binary-patching <yes|no> [Default : yes]

CUDA C 最佳實踐：應用程式效能分析【讀書筆記】

以下為長截圖，CSDN 限定了圖片長度，請點選檢視原圖 gprof： gprof 支援的選項： -b 不再輸出統計圖表中每個欄位的詳細描述。 -q 只輸出函式的呼叫圖（Call graph的那部分資訊）。 -p 只輸出函式的時間

【讀書筆記】計算機網絡1章：課程介紹、協議、分層

視頻打印 http dns 物理層 size cli 電子商務 ann 改變這是我在Coursera上的學習筆記。課程名稱為《Computer Networks》。出自University of Washington。因為計算機網絡才誕生不久

【讀書筆記】：MIT線性代數(1):Linear Combinations

http info cti pla imp column ase fin generate 1. Linear Combination Two linear operations of vectors: Linear combination: 2.Geometric

【讀書筆記】：MIT線性代數(4):Independence, Basis and Dimension

bsp variables inf ane image ace play mit variable Independence: The columns of A are independent when the nullspace N (A) contains only t

【讀書筆記】《csapp》第一章：計算機系統漫遊

第一章計算機系統漫遊這是跟著劉欣大佬讀csapp的作業。這裡不是純粹一本書的讀書筆記，只是摘錄了我感興趣的部分，結合之前的書和感想。儘量讓這個作業有那麼點意義。幾個重要概念 1.抽象還記得大

【讀書筆記】JAVA基礎：1、深入理解JVM

通過《深入理解JAVA虛擬機器》和《深入理解計算機系統》兩本經典著作的學習，注重瞭解系統程序執行時記憶體結構的變化，以此徹底瞭解JVM虛擬機器在執行JAVA程式時的記憶體結構！主要有三個方面： &nb

【讀書筆記】WEB應用：1、日誌配置

log4j.properties 使用一.引數意義說明輸出級別的種類 ERROR、WARN、INFO、DEBUG ERROR 為嚴重錯誤主要是程式的錯誤 WARN 為一般警告，比如session丟失 INFO 為一般要顯示的資訊，比如登入登出 DEBUG 為程式的除錯資訊

【讀書筆記】人人都是架構師：分散式系統架構落地與瓶頸突破

《人人都是架構師：分散式系統架構落地與瓶頸突破》。書主要介紹作者遇到的一些實際場景，提供了處理一些典型場景的思路，書中介紹了許多開源軟體，但程式碼和一些細節比較少。分散式入門書，開拓了視野。大流量消鋒/限流的常規手段 1. 擴容使用叢集技術對伺

【讀書筆記】iOS-截屏功能的實現。

ima under auto core cal ica dsm gef control 一。整個project文件。二，代碼 ViewController.m #import "ViewController.h" #import <Q

【讀書筆記】——終極算法

終極進行生物 nbsp 人工研究院支持向量機來源統計 Note1:網飛的推薦傾向於長尾 Note2: 符號學派：逆向演繹，從哲學、心理學、邏輯學尋求洞見——>逆向演繹連接學派：對大腦進行逆向分析，來源於神經科學和物理學——>反向傳播進化學派：在計

【讀書筆記】iOS-查看一個軟件ipa包的內容

技術 -s alt dsm clas rda 軟件選中 tun 一，打開itunes----->我的iPhone應用程序。二，右鍵點擊app---->在Finder中顯示---->出現下圖所看到的界面。

【讀書筆記】設計心理學2-如何管理復雜【一】

然而困難虛擬前行方式間接行為這就是找到最近在看一些書籍，感覺不寫一些筆記，效果不是特別明顯。出於這個目的，於是有了下面的讀書筆記文章。從《設計心理學2-如何管理復雜》開始寫吧。在看這本書之前，其實自己覺得各種事情只要肯學習，其實都是挺簡單的。但看了本書

【讀書筆記】技術每天一點點--2017.08月

files .html pop 演進 lis 我們 ati 檢測讀書筆記本文地址：http://www.cnblogs.com/aiweixiao/p/7451352.html 本文提綱：概述每天進展 1.【遺留問題】　　1.1）【問

【讀書筆記】閱讀的危險

enter 忘記而是有趣人在很多新的 tex 下一個閱讀的危險　　我脫離我的極簡主義哲學最大的原因之一就是閱讀，確切地說，是閱讀他人在做什麽。我閱讀博文或者雜誌上的文章，上面寫了別人所做的一些有趣的事情：旅行，使用一種新型高效的系統，烤面包等。然後我也想去做那

【讀書筆記】計算機是如何跑起來的

tab 循環隊列 mac 消息傳遞 tracer 私鑰表示記錄一下書中每章我認為的要點。前言作者在前言闡述了一個道理，計算機基礎知識的牢固是深入學習和興趣來源的所在。劃分一個知識範圍-》基礎中的基礎的知識-》設定目標，這些知識可以做什麽第

【讀書筆記】沈默的大多數

style 都是證明幸福如果沒有個人 pan 由於　　人從來都不能從獲得某件令人幸福的物品而獲得幸福，獲得幸福一定是因為某個人做了令他感到幸福的事情。--羅素　　我不敢完全肯定這句話，因為我不能證明它的反面是錯的。正確的前提能推出正確的結論，而錯誤的前提什麽都

【讀書筆記】《Maven實戰》第7章生命周期與插件

命令 ide ner 資源 clas res content 獨立 default 7.1什麽是生命周期軟件開發人員每天都在對項目進行清理、編譯、測試及部署，Maven生命周期是對所有構建過程進行抽象和統一，含項目的清理、初始化、編譯、測試、打包、集成測試、驗證、部署

【讀書筆記】《Effective Java》——創建和銷毀對象

auth static 直接 cdr 也會 pattern cal next false Item 1. 考慮用靜態工廠方法替代構造器獲得一個類的實例時我們都會采取一個公有的構造器。Foo x = new Foo()；同時我們應該掌握另一種方法就是靜態工廠方法（st

【讀書筆記】The Swift Programming Language (Swift 4.0.3)

code any 是個重建之一 eric esc 傳值特定素材：Language Guide 初次接觸 Swift，建議先看下 A Swift Tour,否則思維轉換會很費力，容易卡死或鉆牛角尖。同樣是每一章只總結3個自己認為最重要的點。這樣挺好!強迫你去思考去取

CUDA 記憶體除錯：memcheck，racecheck 【讀書筆記】

相關推薦