<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: About me</title>
	<atom:link href="http://www.xterm.it/blog/about/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.xterm.it/blog</link>
	<description>time to think -  by Marco Campana</description>
	<lastBuildDate>Mon, 05 Jul 2010 17:00:04 +0200</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: marco</title>
		<link>http://www.xterm.it/blog/about/comment-page-1/#comment-57</link>
		<dc:creator>marco</dc:creator>
		<pubDate>Tue, 08 Sep 2009 21:06:40 +0000</pubDate>
		<guid isPermaLink="false">http://www.xterm.it/blog/?page_id=2#comment-57</guid>
		<description>Hi Gracy!

If you want to use the summarizer plugin for posts written in Chinese you will have to do some programming. As described in the plugin documentation, you have to implement:

1) the tokenize() method that is the method responsible for dividing the document in sentences.
2) the normalize() method that is the method responsible for removing inflections from words (like gender, number, person and so on). It&#039;s very likely you can find a stemming algorithm if you google it.

The best thing to do is to have a look at the existing methods for the english language to understand how they works and then apply Chinese language rules to implement the two new methods.
Hope it helps, enjoy!</description>
		<content:encoded><![CDATA[<p>Hi Gracy!</p>
<p>If you want to use the summarizer plugin for posts written in Chinese you will have to do some programming. As described in the plugin documentation, you have to implement:</p>
<p>1) the tokenize() method that is the method responsible for dividing the document in sentences.<br />
2) the normalize() method that is the method responsible for removing inflections from words (like gender, number, person and so on). It&#8217;s very likely you can find a stemming algorithm if you google it.</p>
<p>The best thing to do is to have a look at the existing methods for the english language to understand how they works and then apply Chinese language rules to implement the two new methods.<br />
Hope it helps, enjoy!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Gracy</title>
		<link>http://www.xterm.it/blog/about/comment-page-1/#comment-51</link>
		<dc:creator>Gracy</dc:creator>
		<pubDate>Sat, 05 Sep 2009 17:12:56 +0000</pubDate>
		<guid isPermaLink="false">http://www.xterm.it/blog/?page_id=2#comment-51</guid>
		<description>Hi, for the Post summarizer plugin, I&#039;d like to use Chinese for the the plugin and have already created a &quot;ZH&quot; subdirectory.  But I don&#039;t understand what do you mean by:

2. Create a class that extend the abstract class Document in lib/
3. Implement the tokenize() and normalize() methods.
4. Add config information to che config.php file

Can you help me?</description>
		<content:encoded><![CDATA[<p>Hi, for the Post summarizer plugin, I&#8217;d like to use Chinese for the the plugin and have already created a &#8220;ZH&#8221; subdirectory.  But I don&#8217;t understand what do you mean by:</p>
<p>2. Create a class that extend the abstract class Document in lib/<br />
3. Implement the tokenize() and normalize() methods.<br />
4. Add config information to che config.php file</p>
<p>Can you help me?</p>
]]></content:encoded>
	</item>
</channel>
</rss>
