<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments for The GITS Blog</title>
	<atom:link href="http://ginstrom.com/scribbles/comments/feed/" rel="self" type="application/rss+xml" />
	<link>http://ginstrom.com/scribbles</link>
	<description>Random scribbling about programming, translation, and Japan</description>
	<lastBuildDate>Fri, 03 Feb 2012 09:05:20 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<item>
		<title>Comment on Rapid website development with django and jQuery UI by yelbart</title>
		<link>http://ginstrom.com/scribbles/2010/06/21/rapid-website-development-with-django-and-jquery-ui/comment-page-1/#comment-195696</link>
		<dc:creator>yelbart</dc:creator>
		<pubDate>Fri, 03 Feb 2012 09:05:20 +0000</pubDate>
		<guid isPermaLink="false">http://ginstrom.com/scribbles/?p=1555#comment-195696</guid>
		<description>I&#039;m new to django and i really want this kind of framework using also jquery ui. can i ask a favor then,can you please give me a copy of the site that you made because i want to review on how did you integrate jquery ui to django. thanks a lot...</description>
		<content:encoded><![CDATA[<p>I&#8217;m new to django and i really want this kind of framework using also jquery ui. can i ask a favor then,can you please give me a copy of the site that you made because i want to review on how did you integrate jquery ui to django. thanks a lot&#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Version 2.1 of Count Anything released by kiran</title>
		<link>http://ginstrom.com/scribbles/2011/02/20/version-2-1-of-count-anything-released/comment-page-1/#comment-192018</link>
		<dc:creator>kiran</dc:creator>
		<pubDate>Fri, 27 Jan 2012 13:40:22 +0000</pubDate>
		<guid isPermaLink="false">http://ginstrom.com/scribbles/?p=1738#comment-192018</guid>
		<description>For some files, there is difference between the UI output and command prompt output. Is there any reason why it is?</description>
		<content:encoded><![CDATA[<p>For some files, there is difference between the UI output and command prompt output. Is there any reason why it is?</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on How do you know how much to charge as a freelance translator? by Fadi El-Eter</title>
		<link>http://ginstrom.com/scribbles/2008/10/01/how-do-you-know-how-much-to-charge-as-a-freelance-translator/comment-page-1/#comment-186080</link>
		<dc:creator>Fadi El-Eter</dc:creator>
		<pubDate>Wed, 18 Jan 2012 20:12:44 +0000</pubDate>
		<guid isPermaLink="false">http://ginstrom.com/scribbles/?p=348#comment-186080</guid>
		<description>I guess the rates differ per length of document. I did get a document translated the other day and I paid around 75 cents/word. It was only something like a 200 word document in Chinese and I needed it in English. Paid $150 to do the translation.</description>
		<content:encoded><![CDATA[<p>I guess the rates differ per length of document. I did get a document translated the other day and I paid around 75 cents/word. It was only something like a 200 word document in Chinese and I needed it in English. Paid $150 to do the translation.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on How do you know how much to charge as a freelance translator? by ivan</title>
		<link>http://ginstrom.com/scribbles/2008/10/01/how-do-you-know-how-much-to-charge-as-a-freelance-translator/comment-page-1/#comment-184154</link>
		<dc:creator>ivan</dc:creator>
		<pubDate>Tue, 17 Jan 2012 03:51:52 +0000</pubDate>
		<guid isPermaLink="false">http://ginstrom.com/scribbles/?p=348#comment-184154</guid>
		<description>am living in Mexico and i want to start as a trasnlator Spanihs-english-spanish but i dont know how much can i charge i dont want to charge an excesive quantity of money ($65$ just sounds nuts awesome but nuts) I just want to know how much does a starter like me should charge?</description>
		<content:encoded><![CDATA[<p>am living in Mexico and i want to start as a trasnlator Spanihs-english-spanish but i dont know how much can i charge i dont want to charge an excesive quantity of money ($65$ just sounds nuts awesome but nuts) I just want to know how much does a starter like me should charge?</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Fuzzy substring matching with Levenshtein distance in Python by Ryan Ginstrom</title>
		<link>http://ginstrom.com/scribbles/2007/12/01/fuzzy-substring-matching-with-levenshtein-distance-in-python/comment-page-1/#comment-176777</link>
		<dc:creator>Ryan Ginstrom</dc:creator>
		<pubDate>Sat, 07 Jan 2012 17:27:12 +0000</pubDate>
		<guid isPermaLink="false">http://www.ginstrom.com/scribbles/2007/12/01/fuzzy-substring-matching-with-levenshtein-distance-in-python/#comment-176777</guid>
		<description>@McDougall

It makes sense that this is not bi-directional, because it&#039;s for matching substrings. It should be bi-directional if both strings are of the same length, as you pointed out.</description>
		<content:encoded><![CDATA[<p>@McDougall</p>
<p>It makes sense that this is not bi-directional, because it&#8217;s for matching substrings. It should be bi-directional if both strings are of the same length, as you pointed out.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Fuzzy substring matching with Levenshtein distance in Python by McDougall</title>
		<link>http://ginstrom.com/scribbles/2007/12/01/fuzzy-substring-matching-with-levenshtein-distance-in-python/comment-page-1/#comment-176182</link>
		<dc:creator>McDougall</dc:creator>
		<pubDate>Fri, 06 Jan 2012 21:22:17 +0000</pubDate>
		<guid isPermaLink="false">http://www.ginstrom.com/scribbles/2007/12/01/fuzzy-substring-matching-with-levenshtein-distance-in-python/#comment-176182</guid>
		<description>I have been playing around with your modification to Levenshtein, and I have found some interesting results.

Most importantly: the original Levenshtein is bi-directional, meaning that Levenshtein(a,b) == Levenshtein(b,a)

However with your modification, this is no longer the case! In other words,  FuzzyLevenshtein(a,b) != FuzzyLevenshtein(b,a)

From my experiments, it appears that FuzzyLevenshtein(a,b) can/will be a lower value when b (the target) has extra characters in its prefix/suffix. 
When a (the source) has the extra characters, then the Fuzzy version is the same as the original.

Does this make sense to you? Do you agree? Thanks!</description>
		<content:encoded><![CDATA[<p>I have been playing around with your modification to Levenshtein, and I have found some interesting results.</p>
<p>Most importantly: the original Levenshtein is bi-directional, meaning that Levenshtein(a,b) == Levenshtein(b,a)</p>
<p>However with your modification, this is no longer the case! In other words,  FuzzyLevenshtein(a,b) != FuzzyLevenshtein(b,a)</p>
<p>From my experiments, it appears that FuzzyLevenshtein(a,b) can/will be a lower value when b (the target) has extra characters in its prefix/suffix.<br />
When a (the source) has the extra characters, then the Fuzzy version is the same as the original.</p>
<p>Does this make sense to you? Do you agree? Thanks!</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Version 2.1 of Count Anything released by Ryan Ginstrom</title>
		<link>http://ginstrom.com/scribbles/2011/02/20/version-2-1-of-count-anything-released/comment-page-1/#comment-176046</link>
		<dc:creator>Ryan Ginstrom</dc:creator>
		<pubDate>Fri, 06 Jan 2012 17:39:22 +0000</pubDate>
		<guid isPermaLink="false">http://ginstrom.com/scribbles/?p=1738#comment-176046</guid>
		<description>@kiran

Had to look up &quot;vbc.&quot; Count Anything counts &quot;all characters&quot; and &quot;characters (no spaces)&quot;. The second measure is the total character count minus the number of characters for which &lt;code&gt;c.isspace()&lt;/code&gt; returns &lt;code&gt;true&lt;/code&gt;.</description>
		<content:encoded><![CDATA[<p>@kiran</p>
<p>Had to look up &#8220;vbc.&#8221; Count Anything counts &#8220;all characters&#8221; and &#8220;characters (no spaces)&#8221;. The second measure is the total character count minus the number of characters for which <code>c.isspace()</code> returns <code>true</code>.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Version 2.1 of Count Anything released by kiran</title>
		<link>http://ginstrom.com/scribbles/2011/02/20/version-2-1-of-count-anything-released/comment-page-1/#comment-175874</link>
		<dc:creator>kiran</dc:creator>
		<pubDate>Fri, 06 Jan 2012 13:43:14 +0000</pubDate>
		<guid isPermaLink="false">http://ginstrom.com/scribbles/?p=1738#comment-175874</guid>
		<description>Does the character count calculated by Count Anything include spaces as well as vbc’s ?</description>
		<content:encoded><![CDATA[<p>Does the character count calculated by Count Anything include spaces as well as vbc’s ?</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Parsing multilingual email with Python by Ryan Ginstrom</title>
		<link>http://ginstrom.com/scribbles/2007/11/19/parsing-multilingual-email-with-python/comment-page-1/#comment-169636</link>
		<dc:creator>Ryan Ginstrom</dc:creator>
		<pubDate>Fri, 30 Dec 2011 15:54:14 +0000</pubDate>
		<guid isPermaLink="false">http://www.ginstrom.com/scribbles/2007/11/19/parsing-multilingual-email-with-python/#comment-169636</guid>
		<description>@Bart Excellent, thanks.</description>
		<content:encoded><![CDATA[<p>@Bart Excellent, thanks.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Parsing multilingual email with Python by Bart Jellema</title>
		<link>http://ginstrom.com/scribbles/2007/11/19/parsing-multilingual-email-with-python/comment-page-1/#comment-163459</link>
		<dc:creator>Bart Jellema</dc:creator>
		<pubDate>Fri, 23 Dec 2011 01:39:12 +0000</pubDate>
		<guid isPermaLink="false">http://www.ginstrom.com/scribbles/2007/11/19/parsing-multilingual-email-with-python/#comment-163459</guid>
		<description>The email field is quite problematic, I think I’ve nailed it now:

&lt;pre&gt;
def get_multilingual_header(header_text, default=”ascii”):
    if not header_text is None:
        try:
            headers = header.decode_header(header_text)
        except HeaderParseError:
            return u”Error”

        try:
            header_sections = [unicode(text, charset if charset and charset!=&#039;unknown&#039; else default, errors=&#039;replace&#039;) for text, charset in headers]
        except LookupError:
            header_sections = [unicode(text, default, errors=&#039;replace&#039;) for text, charset in headers]

        return u”&quot;.join(header_sections)
    else:
        return None

def decode_email(raw_email):
    raw_email = raw_email.replace(‘\r’, ‘ ‘).replace(‘\n’, ‘ ‘).replace(‘ ‘, ‘ ‘)
    if re.match(‘=\?.*?\?[QqBb]\?.*\?=$’, raw_email):
        name, email = utils.parseaddr(get_multilingual_header(raw_email))
    else:
        name, email = utils.parseaddr(raw_email)
        name = get_multilingual_header(name)
        email = get_multilingual_header(email)

    decoded_email = utils.formataddr((name, email))
    return decoded_email
&lt;/pre&gt;

This worked for all my test cases, in particular:

&lt;pre&gt;
print decode_email(‘=?UTF-8?B?5qGD5aSqLCDpg44=?= ‘)
&lt;/pre&gt;

This is how GMail sends it these days and if you decode the whole string first you’re in trouble because decoded it contains a “,”, which means you end up with 2 email addresses</description>
		<content:encoded><![CDATA[<p>The email field is quite problematic, I think I’ve nailed it now:</p>
<pre>
def get_multilingual_header(header_text, default=”ascii”):
    if not header_text is None:
        try:
            headers = header.decode_header(header_text)
        except HeaderParseError:
            return u”Error”

        try:
            header_sections = [unicode(text, charset if charset and charset!='unknown' else default, errors='replace') for text, charset in headers]
        except LookupError:
            header_sections = [unicode(text, default, errors='replace') for text, charset in headers]

        return u”".join(header_sections)
    else:
        return None

def decode_email(raw_email):
    raw_email = raw_email.replace(‘\r’, ‘ ‘).replace(‘\n’, ‘ ‘).replace(‘ ‘, ‘ ‘)
    if re.match(‘=\?.*?\?[QqBb]\?.*\?=$’, raw_email):
        name, email = utils.parseaddr(get_multilingual_header(raw_email))
    else:
        name, email = utils.parseaddr(raw_email)
        name = get_multilingual_header(name)
        email = get_multilingual_header(email)

    decoded_email = utils.formataddr((name, email))
    return decoded_email
</pre>
<p>This worked for all my test cases, in particular:</p>
<pre>
print decode_email(‘=?UTF-8?B?5qGD5aSqLCDpg44=?= ‘)
</pre>
<p>This is how GMail sends it these days and if you decode the whole string first you’re in trouble because decoded it contains a “,”, which means you end up with 2 email addresses</p>
]]></content:encoded>
	</item>
</channel>
</rss>

