爱气象,爱气象家园! 

气象家园

 找回密码
 立即注册

QQ登录

只需一步,快速开始

搜索
查看: 7897|回复: 1

CESM提交作业不成功

[复制链接]
发表于 2015-4-26 17:12:55 | 显示全部楼层 |阅读模式

登录后查看更多精彩内容~

您需要 登录 才可以下载或查看,没有帐号?立即注册

x
CESM作业build成功后再大型机上提交,运行一下就掉了。
log_cesm.log的信息是
--------------------------------------------------------------------------
mpirun was unable to launch the specified application as it could not access
or execute an executable:
Executable: ./ccsm.exe
Node: c03n08
while attempting to start process rank 0.
--------------------------------------------------------------------------
36 total processes failed to start
我的run.pbs文件
#!/bin/bash
############################
# WH, NUIST, 2015, Apr., for running
############################
### Redirect standard output and error: ############################################################
#
#
# remove previous log and err
rm -f `basename ${0%.*}`.log `basename ${0%.*}`.err
rm -f PBSSCRIPT.pbs
rm -f CESM.e*
rm -f CESM.o*
#
exec 1>>`basename ${0%.*}`.log
exec 2>>`basename ${0%.*}`.err
#
set -ex                            # Abort execution if an error occurs, do detailed logging
#
### Run parameters #################################################################################
#                    # Experiment identifier
WORKDIR=`pwd`
#
#
#
NODES=3
NC=12
NCPUS=`expr $NODES \* $NC`                 # number of CPUs for mpirun
#
# BUILD QSUB
cat > PBSSCRIPT.pbs << EOF
#!/bin/bash
#PBS -S /bin/bash
#PBS -N Niuniu
#PBS -P P001721001
#PBS -q Regular
#PBS -l select=$NODES:ncpus=$NC
#PBS -l walltime=12:00:00
#PBS -l place=free:excl
cd $WORKDIR
echo "Time to start:--------------"
date

mpirun -np $NCPUS ./ccsm.exe >& log_ccsm.log

echo "The Job Is Done! At:---------------"
date
EOF
qsub PBSSCRIPT.pbs
请问各位,会是什么问题啊?

密码修改失败请联系微信:mofangbao
 楼主| 发表于 2015-4-26 17:53:26 | 显示全部楼层
已解决,是路径问题。
但是现在出现新的问题,
(seq_io_init) pio init parameters: before nml read
(seq_io_init)    pio_stride   =    -99
(seq_io_init)    pio_root     =    -99
(seq_io_init)    pio_typename = nothing                                                                        
(seq_io_init)    pio_numtasks =    -99
(seq_io_init)    pio_debug_level =      0
    pio_async_interface =   F
PGFIO-F-209/OPEN/unit=99/'OLD' specified for file which does not exist.
File name = drv_in
In source file /nuist/p/work/WH2015/CESM/cesm1_0_3/models/drv/shr/seq_io_mod.F90, at line number 283
--------------------------------------------------------------------------
mpirun has exited due to process rank 0 with PID 15600 on
node c19n07 exiting improperly. There are two reasons this could occur:

1. this process did not call "init" before exiting, but others in
the job did. This can cause a job to hang indefinitely while it waits
for all processes to call "init". By rule, if one process calls "init",
then ALL processes must call "init" prior to termination.

2. this process called "init", but exited without calling "finalize".
By rule, all processes that call "init" MUST call "finalize" prior to
exiting or it will be considered an "abnormal termination"

This may have caused other processes in the application to be
terminated by signals sent by mpirun (as reported here).
是什么原因啊?
密码修改失败请联系微信:mofangbao
您需要登录后才可以回帖 登录 | 立即注册

本版积分规则

Copyright ©2011-2014 bbs.06climate.com All Rights Reserved.  Powered by Discuz! (京ICP-10201084)

本站信息均由会员发表,不代表气象家园立场,禁止在本站发表与国家法律相抵触言论

快速回复 返回顶部 返回列表