On 05/11/2013 17:30, SATOH Fumiyasu wrote:
At Tue, 5 Nov 2013 08:10:46 +0100 (CET), Steffen Kaiser wrote:
The program has at least 2 bugs in it:
. If the body has paragraph break (i.e., '\n') followed by the RFC822 keyword 'From', the original message will loose the last half of the message and a phantom message will be created.
Change from my notes:
if ( /^From /
-to-
if ( /^From .*? \d\d:\d\d:\d\d \d\d\d\d/
. I never could figure out where the second bug was. This one created some messages with blank subject lines.
You have a badly formatted mbox file, if there is such distinction neccessary:
No. There are some variants of mbox format. See https://en.wikipedia.org/wiki/Mbox#Family
RFC 4155
o Each message in the mbox database MUST be immediately preceded by a single separator line, which MUST conform to the following syntax:
The exact character sequence of "From";
a single Space character (0x20);
the email address of the message sender (as obtained from the
message envelope or other authoritative source), conformant
with the "addr-spec" syntax from RFC 2822;
http://manpages.ubuntu.com/manpages/precise/man5/mbox.5.html
" In order to avoid misinterpretation of lines in message bodies which begin with the four characters "From", followed by a space
character, the mail delivery agent must quote any occurrence of "From " at the start of a body line. "