Jython对中文支持的进一步认识

MeteoInfo · 发表于 2014-11-29 23:39:48

登录后查看更多精彩内容~

您需要登录才可以下载或查看，没有帐号？立即注册

x

通过和Jython开发团队的人沟通，对Jython对unicode编码支持的过程有了一定的了解，原先以为高版本Jython不支持中文的看法是错误的。从Jython 2.5以后对字符串以byte数组方式处理，和Jython 2.2有很大的区别，这是为了和CPython保持一致。Jeff Allen在email中的原文这样写到：

Up to (I think) v2.2, Jython proudly supported Unicode in its str type, in which the elements were 16-bit characters. After that, the Python language introduced a proper unicode type and from 2.5 we had to treat str as bytes. It follows that the object printed in your example would have to be a unicode literal u"xy", to have any chance of working. This is the same as in CPython.

通过他的提示，以及查阅Jython的源代码，琢磨出了在Jython高版本中（2.5和2.7）正确使用中文的方法。

1. 由于我是在Java程序中嵌入Jython脚本，用到Jython的PythonInterpreter和InteractiveConsole类（继承自PythonInterpreter），需要将PythonInterpreter的cflags.source_is_utf8设置为true，该参数缺省为false，会将字符串以“ISO-8859-1”编码转为byte数组，此编码是不支持中文的。

2. 还需要将Jython的SystemState缺省编码设为“utf-8”： Py.getSystemState().setdefaultencoding("utf-8"); 。

3. 在脚本第一行添加指示脚本文件所用编码的语句：
# coding=utf-8
注意等号两边不能有空格

4. 在用到中文字符的地方之前要加字母u，比如
a = u'中文'

一个简单的测试脚本：

通过上述修改，形成了MeteoInfo Java 1.1.7R1版本（最近版本有点多，哈哈），使用了Jython最新的版本2.7b3。

rceclx · 发表于 2014-11-30 00:02:37

向王老师学习！

晚安！

兰溪之水 · 发表于 2014-11-30 09:12:02

wysyhd · 发表于 2014-11-30 09:58:54

老师太厉害，，，赞赞，，，

fuyuanxuesheng · 发表于 2014-12-2 17:32:56

不错，顶一个

正能量 · 发表于 2017-12-30 16:21:47

{:eb303:}{:eb303:}

liuxiunuli · 发表于 2018-1-2 00:06:00

向王老师学习！

{:eb502:}

		自动登录	找回密码
密码			立即注册

Jython对中文支持的进一步认识

登录后查看更多精彩内容~

相关帖子

浏览过的版块