| 2 | 1/1 | 返回列表 |
| 查看: 2045 | 回復(fù): 1 | ||
qh203銅蟲 (小有名氣)
|
[求助]
root和普通用戶下并行計算問題
|
|
在root用戶下,用openmpi并行計算cpi 這個算例,6個節(jié)點(diǎn),每個節(jié)點(diǎn)8個cpu。輸出正常,如下 [root@node1 examples]# mpirun -np 40 -machinefile test ./cpi Process 3 on node2 Process 38 on node6 Process 18 on node4 Process 32 on node6 Process 20 on node4 Process 2 on node2 Process 35 on node6 Process 34 on node6 Process 22 on node4 Process 7 on node2 Process 23 on node4 Process 5 on node2 Process 4 on node2 Process 37 on node6 Process 33 on node6 Process 30 on node5 Process 8 on node3 Process 26 on node5 Process 10 on node3 Process 15 on node3 Process 27 on node5 Process 31 on node5 Process 28 on node5 Process 24 on node5 Process 19 on node4 Process 21 on node4 Process 17 on node4 Process 6 on node2 Process 16 on node4 Process 25 on node5 Process 9 on node3 Process 11 on node3 Process 13 on node3 Process 14 on node3 Process 0 on node2 Process 1 on node2 Process 36 on node6 Process 39 on node6 Process 12 on node3 Process 29 on node5 pi is approximately 3.1416009869231245, Error is 0.0000083333333314 wall clock time = 0.128546 在普通用戶下用openmpi并行計算cpi這個算例,輸出則變成 [aojjj@node1 examples]$ mpirun -np 40 -machinefile test ./cpi libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. -------------------------------------------------------------------------- The OpenFabrics (openib) BTL failed to register memory in the driver. Please check /var/log/messages or dmesg for driver specific failure reason. The failure occured here: Local host: mthca0 Device: openib_reg_mr Function: Cannot allocate memory() Errno says: You may need to consult with your system administrator to get this problem fixed. -------------------------------------------------------------------------- -------------------------------------------------------------------------- The OpenFabrics (openib) BTL failed to initialize while trying to allocate some locked memory. This typically can indicate that the memlock limits are set too low. For most HPC installations, the memlock limits should be set to "unlimited". The failure occured here: Local host: node4 OMPI source: btl_openib_component.c:1161 Function: ompi_free_list_init_ex_new() Device: mthca0 Memlock limit: 32768 You may need to consult with your system administrator to get this problem fixed. This FAQ entry on the Open MPI web site may also be helpful: http://www.open-mpi.org/faq/?category=openfabrics#ib-locked-pages -------------------------------------------------------------------------- -------------------------------------------------------------------------- WARNING: There was an error initializing an OpenFabrics device. Local host: node4 Local device: mthca0 -------------------------------------------------------------------------- libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes. This will severely limit memory registrations. Process 26 on node5 Process 8 on node3 Process 28 on node5 Process 1 on node2 Process 29 on node5 Process 4 on node2 Process 22 on node4 Process 2 on node2 Process 15 on node3 Process 25 on node5 Process 31 on node5 Process 38 on node6 Process 14 on node3 Process 30 on node5 Process 32 on node6 Process 39 on node6 Process 37 on node6 Process 33 on node6 Process 36 on node6 Process 35 on node6 Process 16 on node4 Process 18 on node4 Process 10 on node3 Process 21 on node4 Process 19 on node4 Process 20 on node4 Process 11 on node3 Process 17 on node4 Process 9 on node3 Process 0 on node2 Process 7 on node2 Process 6 on node2 Process 5 on node2 Process 23 on node4 Process 24 on node5 Process 3 on node2 Process 27 on node5 Process 34 on node6 Process 12 on node3 Process 13 on node3 pi is approximately 3.1416009869231245, Error is 0.0000083333333314 wall clock time = 3.002147 [node1:02112] 39 more processes have sent help message help-mpi-btl-openib.txt / mem-reg-fail [node1:02112] Set MCA parameter "orte_base_help_aggregate" to 0 to see all help / error messages [node1:02112] 36 more processes have sent help message help-mpi-btl-openib.txt / init-fail-no-mem [node1:02112] 39 more processes have sent help message help-mpi-btl-openib.txt / error in device init 也計算出來了,但是多了許多warniing 和error的提示。 在各個節(jié)點(diǎn)修改了/etc/security/limits.conf 和/etc/init.d/sshd, 還是不行。 到底問題在哪里? |
銅蟲 (小有名氣)
| 2 | 1/1 | 返回列表 |
| 最具人氣熱帖推薦 [查看全部] | 作者 | 回/看 | 最后發(fā)表 | |
|---|---|---|---|---|
|
[考研] 一志愿上海交大生物與醫(yī)藥專碩324分,求調(diào)劑 +6 | jiajunX 2026-03-22 | 6/300 |
|
|---|---|---|---|---|
|
[考研] 299求調(diào)劑 +7 | 某某某某位 2026-03-21 | 8/400 |
|
|
[考研] 材料專碩 335 分求調(diào)劑 +4 | 拒絕冷暴力 2026-03-25 | 4/200 |
|
|
[考研] 考研一志愿蘇州大學(xué)初始315(英一)求調(diào)劑 +3 | sbdksD 2026-03-24 | 4/200 |
|
|
[考研] 085600材料與化工調(diào)劑 +9 | A-哆啦Z夢 2026-03-23 | 15/750 |
|
|
[考研] 287求調(diào)劑 +10 | 晨昏線與星海 2026-03-19 | 11/550 |
|
|
[考研] 07化學(xué)280分求調(diào)劑 +7 | 722865 2026-03-23 | 7/350 |
|
|
[考研] 300分,材料,求調(diào)劑,英一數(shù)二 +5 | 超贊的 2026-03-24 | 5/250 |
|
|
[考博] 申博26年 +4 | 八6八68 2026-03-19 | 4/200 |
|
|
[考研] 【雙一流院校新能源、環(huán)境材料,材料加工與模擬招收大量調(diào)劑】 +4 | Higraduate 2026-03-22 | 7/350 |
|
|
[考研] 求調(diào)劑 +7 | 十三加油 2026-03-21 | 7/350 |
|
|
[考研] 269求調(diào)劑 +4 | 我想讀研11 2026-03-23 | 4/200 |
|
|
[考研] 化學(xué)308分求調(diào)劑 +3 | 你好明天你好 2026-03-23 | 3/150 |
|
|
[考研] 311求調(diào)劑 +3 | 26研0 2026-03-20 | 3/150 |
|
|
[考研] 303求調(diào)劑 +5 | 安憶靈 2026-03-22 | 6/300 |
|
|
[考研] 材料求調(diào)劑 +5 | @taotao 2026-03-21 | 5/250 |
|
|
[考研] 一志愿深大,0703化學(xué),總分302,求調(diào)劑 +4 | 七月-七七 2026-03-21 | 4/200 |
|
|
[考研] 279求調(diào)劑 +5 | 紅衣隱官 2026-03-21 | 5/250 |
|
|
[考研] 一志愿武漢理工材料工程專碩調(diào)劑 +9 | Doleres 2026-03-19 | 9/450 |
|
|
[考研] 一志愿 西北大學(xué) ,070300化學(xué)學(xué)碩,總分287,雙非一本,求調(diào)劑。 +4 | 晨昏線與星海 2026-03-19 | 4/200 |
|