R语言 tm包 preprocessReut21578XML()函数中文帮助文档(中英文对照)

loveR · 发表于 2012-10-1 10:51:25

preprocessReut21578XML(tm)
preprocessReut21578XML()所属R语言包：tm

                                    Preprocess the Reuters-21578 XML archive.
                                       预处理路透-21578 XML存档。

                                       译者：生物统计家园网机器人LoveR

描述----------Description----------

Preprocess the Reuters-21578 <acronym>XML</acronym> archive by correcting invalid UTF-8 encodings and copying each text document into a separate file.
预处理路透“21578 <acronym>XML</首字母缩写纠正无效的UTF-8编码，每个文本文件复制到一个单独的文件归档。

用法----------Usage----------

preprocessReut21578XML(input, output, fixEnc = TRUE)

参数----------Arguments----------

参数：input
A character describing the input directory.
一个字符描述输入目录。

参数：output
A character describing the output directory.
一个字符描述输出目录。

参数：fixEnc
A logical value indicating whether an invalid UTF-8 encoding in the Reuters-21578 <acronym>XML</acronym> dataset should be corrected.
一个逻辑值，该值指示是否无效的UTF-8编码在路透社-21578 <acronym> XML </首字母缩写数据集应该被纠正。

值----------Value----------

No explicit return value. As a side product the directory output contains the corrected dataset.
没有明确的返回值。作为副产物的目录output包含校正后的数据集。

（作者）----------Author(s)----------

Ingo Feinerer

参考文献----------References----------

Collection Distribution 1.0. http://kdd.ics.uci.edu/databases/reuters21578/reuters21578.html
http://modnlp.berlios.de/reuters21578.html
转载请注明:出自生物统计家园网(http://www.biostatistic.net)。

注：
注1：为了方便大家学习，本文档为生物统计家园网机器人LoveR翻译而成，仅供个人R语言学习参考使用，生物统计家园保留版权。
注2：由于是机器人自动翻译，难免有不准确之处，使用时仔细对照中、英文内容进行反复理解，可以帮助R语言的学习。
注3：如遇到不准确之处，请在本贴的后面进行回帖，我们会逐渐进行修订。

账号		自动登录	找回密码
密码			注册

R语言 tm包 preprocessReut21578XML()函数中文帮助文档(中英文对照)

浏览过的版块