爱气象,爱气象家园! 

气象家园

 找回密码
 立即注册

QQ登录

只需一步,快速开始

新浪微博登陆

只需一步, 快速开始

搜索
查看: 102|回复: 0

关于WRF-CHEM作业中断的问题= =

[复制链接]
回帖奖励 32 金钱 回复本帖可获得 4 金钱奖励! 每人限 1 次

新浪微博达人勋

发表于 2024-12-11 00:20:02 | 显示全部楼层 |阅读模式

登录后查看更多精彩内容~

您需要 登录 才可以下载或查看,没有帐号?立即注册 新浪微博登陆

x
本帖最后由 叫我小岳 于 2024-12-11 00:21 编辑

通过pbs脚本设置:#PBS -l nodes=6:ppn=15
提交作业至服务器,发现作业总是在运行至00时22分45秒就会中断,查询rsl.error.*日志并没有报错
查询stdout.log发现存在以下报错:
This job allocated 90 procs
--------------------------------------------------------------------------
mpirun has exited due to process rank 60 with PID 0 on
node c05 exiting improperly. There are three reasons this could occur:

1. this process did not call "init" before exiting, but others in
the job did. This can cause a job to hang indefinitely while it waits
for all processes to call "init". By rule, if one process calls "init",
then ALL processes must call "init" prior to termination.

2. this process called "init", but exited without calling "finalize".
By rule, all processes that call "init" MUST call "finalize" prior to
exiting or it will be considered an "abnormal termination"

3. this process called "MPI_Abort" or "orte_abort" and the mca parameter
orte_create_session_dirs is set to false. In this case, the run-time cannot
detect that the abort call was an abnormal termination. Hence, the only
error message you will receive is this one.

This may have caused other processes in the application to be
terminated by signals sent by mpirun (as reported here).

You can avoid this message by specifying -quiet on the mpirun command line.

--------------------------------------------------------------------------
并且调整nodes&ppn的设置大一点也是这样的报错
各位大佬有遇到过这个问题的嘛?
密码修改失败请联系微信:mofangbao
您需要登录后才可以回帖 登录 | 立即注册 新浪微博登陆

本版积分规则

Copyright ©2011-2014 bbs.06climate.com All Rights Reserved.  Powered by Discuz! (京ICP-10201084)

本站信息均由会员发表,不代表气象家园立场,禁止在本站发表与国家法律相抵触言论

快速回复 返回顶部 返回列表