| 5 | 1/1 | 返回列表 |
| 查看: 3321 | 回復: 5 | ||
| 當前只顯示滿足指定條件的回帖,點擊這里查看本話題的所有回帖 | ||
04nylxb木蟲 (正式寫手)
|
[求助]
vasp跨節(jié)點運行出錯,mpiexec_node-1 (handle_stdin_input 1089)
|
|
|
最近在集群上編譯帶CNEB的vasp5.2,并行vasp編譯成功,在單個節(jié)點(每個節(jié)點八核)上運行 $ mpirun -np 8 vasp 時候,top下,發(fā)現(xiàn)確實出現(xiàn)八個vasp進程。 但是,跨節(jié)點的時候,確出錯了,出錯信息如下: running on 15 nodes distr: one band on 1 nodes, 15 groups vasp.5.2.12 11Nov11 complex POSCAR found : 1 types and 2 ions ----------------------------------------------------------------------------- | | | W W AA RRRRR N N II N N GGGG !!! | | W W A A R R NN N II NN N G G !!! | | W W A A R R N N N II N N N G !!! | | W WW W AAAAAA RRRRR N N N II N N N G GGG ! | | WW WW A A R R N NN II N NN G G | | W W A A R R N N II N N GGGG !!! | | | | For optimal performance we recommend that you set | | NPAR = approx SQRT( number of cores) | | This will greatly improve the performance of VASP for DFT. | | The default NPAR=number of cores might be grossly inefficient | | on modern multi-core architectures or massively parallel machines. | | Unfortunately you need to use the default for hybrid, GW and RPA | | calculations. | | | ----------------------------------------------------------------------------- LDA part: xc-table for Pade appr. of Perdew found WAVECAR, reading the header number of bands has changed, file: 12 present: 15 trying to continue reading WAVECAR, but it might fail POSCAR, INCAR and KPOINTS ok, starting setup WARNING: small aliasing (wrap around) errors must be expected FFT: planning ...( 1 ) reading WAVECAR random initialization beyond band 13 the WAVECAR file was read sucessfully initial charge from wavefunction entering main loop N E dE d eps ncg rms rms(c) mpiexec_node-1 (handle_stdin_input 1089): stdin problem; if pgm is run in background, redirect from /dev/null mpiexec_node-1 (handle_stdin_input 1090): e.g.: mpiexec -n 4 a.out < /dev/null & rank 14 in job 14 node-1_49061 caused collective abort of all ranks exit status of rank 14: killed by signal 11 rank 13 in job 14 node-1_49061 caused collective abort of all ranks exit status of rank 13: killed by signal 9 rank 9 in job 14 node-1_49061 caused collective abort of all ranks exit status of rank 9: killed by signal 11 rank 8 in job 14 node-1_49061 caused collective abort of all ranks exit status of rank 8: killed by signal 11 rank 4 in job 14 node-1_49061 caused collective abort of all ranks exit status of rank 4: killed by signal 11 rank 3 in job 14 node-1_49061 caused collective abort of all ranks exit status of rank 3: killed by signal 9 rank 2 in job 14 node-1_49061 caused collective abort of all ranks exit status of rank 2: killed by signal 9 rank 1 in job 14 node-1_49061 caused collective abort of all ranks exit status of rank 1: killed by signal 11 rank 0 in job 14 node-1_49061 caused collective abort of all ranks 其中node-1是我的控制節(jié)點。進程數(shù)為12以下的時候都運行正常 $ mpirun -machinefile ~/machinefile -np 12 vasp > 5out 其中,mpich2,我用cpi測試,各個節(jié)點都OK的,并且能夠跑上百個核。 求高人指點,為什么vasp跨節(jié)點的時候出現(xiàn)這樣的錯誤?該如何解決?非常感謝啊。 另,想問下,編譯的時候,make makeparam,生成的這個makeparam是干嘛用的? |

榮譽版主 (職業(yè)作家)
木蟲 (正式寫手)

榮譽版主 (著名寫手)
木蟲 (正式寫手)
|
非常感謝。 嗯,NPAR我都設成了并行的核數(shù)了,感覺這個節(jié)點數(shù)無法估計啊,有時候任務調度系統(tǒng)分配給4個節(jié)點,有時候分配給10個節(jié)點。是否不需要嚴格的節(jié)點數(shù)?按照它說的近似corse的開方即可? mpi方面,我用的是mpich2,我用Mpi自帶的examples下面的cpi測試,發(fā)現(xiàn)并行都是順利完成,指定幾個節(jié)點,輸出里面會有相應的節(jié)點運行報告,是否可以說mpi安裝是好的? 我昨天測試運行的時候還發(fā)現(xiàn)一個問題,有時候去提交任務,-np 64之類的,任務正常,各個節(jié)點都會分配vasp任務,然后過了一兩個小時之后,再次運行同樣的任務,vasp又出現(xiàn)上面的錯誤了,汗,郁悶啊。 |

| 最具人氣熱帖推薦 [查看全部] | 作者 | 回/看 | 最后發(fā)表 | |
|---|---|---|---|---|
|
[考研] 306求0703調劑一志愿華中師范 +6 | 紙魚ly 2026-03-21 | 6/300 |
|
|---|---|---|---|---|
|
[考研]
|
pk3725069 2026-03-19 | 13/650 |
|
|
[考研] 070300化學求調劑 +5 | 苑豆豆 2026-03-20 | 5/250 |
|
|
[考研] 一志愿北京化工大學 070300 學碩 336分 求調劑 +5 | vv迷 2026-03-22 | 5/250 |
|
|
[考研] 307求調劑 +11 | 冷笙123 2026-03-17 | 11/550 |
|
|
[考研] 求調劑一志愿海大,0703化學學碩304分,有大創(chuàng)項目,四級已過 +6 | 幸運哩哩 2026-03-22 | 10/500 |
|
|
[考研] 一志愿華中農業(yè)071010,總分320求調劑 +5 | 困困困困坤坤 2026-03-20 | 6/300 |
|
|
[考研] 260求調劑 +3 | 朱芷琳 2026-03-20 | 4/200 |
|
|
[基金申請] 山東省面上項目限額評審 +4 | 石瑞0426 2026-03-19 | 4/200 |
|
|
[考研] 考研調劑 +3 | 呼呼?~+123456 2026-03-21 | 3/150 |
|
|
[考研] 求調劑 +4 | 要好好無聊 2026-03-21 | 4/200 |
|
|
[考研] 302求調劑 +12 | 呼呼呼。。。。 2026-03-17 | 12/600 |
|
|
[考研] 一志愿天津大學化學工藝專業(yè)(081702)315分求調劑 +12 | yangfz 2026-03-17 | 12/600 |
|
|
[考研] 307求調劑 +3 | wyyyqx 2026-03-17 | 3/150 |
|
|
[考研] 301求調劑 +10 | yy要上岸呀 2026-03-17 | 10/500 |
|
|
[考研] 332求調劑 +4 | ydfyh 2026-03-17 | 4/200 |
|
|
[考研] 一志愿武理材料305分求調劑 +6 | 想上岸的鯉魚 2026-03-18 | 7/350 |
|
|
[考研] 一志愿西南交大,求調劑 +5 | 材化逐夢人 2026-03-18 | 5/250 |
|
|
[考研] 288求調劑 +16 | 于海海海海 2026-03-19 | 16/800 |
|
|
[考研] 材料考研調劑 +3 | xwt。 2026-03-19 | 3/150 |
|