[Mailman-Users] Mime conversions - missing carriage returns andoddcharacters

Mark Sapiro msapiro at value.net
Mon Mar 5 18:53:29 CET 2007


Ryan Steele wrote:

>Mark Sapiro wrote:
>> Ryan Steele wrote:
>>   
>>> Mark Sapiro wrote:
>>>     
>>>> Do you have convert_html_to_plaintext set to Yes?
>>>>   
>>>>       
>>> Affirmative, I do.
>>>     
>>
>>
>> Based on what I see, I don't think that is the problem.


It definitely IS the issue with this one. 
>I'm still working on getting the client to be able to consistently 
>reproduce the character misrepresentation (UTF-8) issue, so I won't 
>bother the list with that for now.  However, here's a good example of 
>one with stripped carriage returns.  Sorry for the delay.  I didn't post 
>this to the list because I didn't want a search engine spider crawling 
>the information in the header... feel free to post a reply with those 
>snipped!
>
>Before making it to the list (carriage return omission example):
>
>######################################################################################
>
<snip>
>X-Mailer: QUALCOMM Windows Eudora Version 7.1.0.9
>Content-type: text/html; charset=us-ascii
<snip>
>
><html>
><body>
>Hello, Ryan<br><br>
>I am writing each line flush left<br><br>
>And, I have inserted a CR between each line<br><br>
>I am copying you on the original<br><br>
>And sending a copy to test2<br><br>
>I think you'll see that the text shifts 2 spaces to the right<br><br>
>And that all the CRs are deleted<br><br>
>And that urls get footnoted.<br><br>
>For instance,
>
><a href="http://www.odnetwork.org/" eudora="autourl">www.odnetwork.org<br>
><br>
></a>Plus the url in my sig block<br><br>
>I am typing bold in <b>bold </b>and italics in <i>italics<br><br>
></i>I hope that helps.<br><br>
>Bill you will also get a copy of this, which you can ignore.<br><br>
>Matt<br>
>
<snip>
</body>
></html>
>
>######################################################################################
>
>After making it to the list (carriage return omission example):
>
>######################################################################################
>
<snip>
>X-Content-Filtered-By: Mailman/MimeDel 2.1.8
<snip>
>Content-Type: text/plain; charset="us-ascii"
>Content-Transfer-Encoding: 7bit
<snip>
>
>
>
>   Hello, Ryan
>   I am writing each line flush left
>   And, I have inserted a CR between each line
>   I am copying you on the original
>   And sending a copy to test2
>   I think you'll see that the text shifts 2 spaces to the right
>   And that all the CRs are deleted
>   And that urls get footnoted.
>   For instance, [1]www.odnetwork.org
>   Plus the url in my sig block
>   I am typing bold in bold and italics in italics
>   I hope that helps.
>   Bill you will also get a copy of this, which you can ignore.
>   Matt
>
<snip>
>
>References
>
>   1. http://www.odnetwork.org/
<snip>


The original post was html only, thus it was converted to plain text.
The indentation and footnoting of hyperlinks is all done by your
HTML_TO_PLAIN_TEXT_COMMAND (default = '/usr/bin/lynx -dump
%(filename)s'). I'm not sure what 'carriage returns' are missing, but
if you're referring to the html that renders as double spaces being
renderd single spaced, that's lynx too. If you prefer a double spaced
rendering, you can put

HTML_TO_PLAIN_TEXT_COMMAND = '/usr/bin/links -dump %(filename)s'

in mm_cfg.py to replace lynx with links (verify that you have links and
that is the correct path). The rest of what links does with this is
very similar to what lynx does.

If in the other cases, we are converting utf-8 html to plain text, I
think that explains why we 'lose' the character set. I think there are
definitely problems in this area. It didn't look like that was the
issue from a previous post, but as the messages were incomplete in
that example, I may have misinterpreted what was happening.

If your client would post plain text or even multipart/alternative, we
wouldn't have to convert the html.

See <http://www.expita.com/nomime.html#eudora5> (hopefully it also
applies to Eudora 7.)

-- 
Mark Sapiro <msapiro at value.net>       The highway is for gamblers,
San Francisco Bay Area, California    better use your sense - B. Dylan



More information about the Mailman-Users mailing list