[Dovecot] fts-solr plugin issue (Marked invalid)

Nikolai Derzhak nikolai at 6zap.com
Mon May 4 17:16:06 EEST 2009


OK. Concentrating problem in one question.
How to ignore "bad" message and index next one in indexing procedure (fts plugin) ?.
Now, one "error 500" from solr and dovecot (# 1.1.11: /etc/dovecot/dovecot.conf
# OS: Linux 2.6.21.7-2.fc8xen i686 Ubuntu 8.04.2 ext3
) stop and each next search query repeat the story.
I've explored fts, and ftp-solr directories in src, without success for now.
Timo, you understand code much bettter, can you help me and point to place in code,
or probably create some patch, if possible ?.

On Fri, 05/01/2009 at 5:56pm, "Nikolai Derzhak" <nikolai at 6zap.com> wrote:
> Hi.
> 
> We user ftp-solr plugin and have problem with solr-1.3+
> with HTMLStripWhitespaceTokenizerFactory (Solr schema in attachments).
> In some maildir's present messages with wrong "Content-Type: " fields in 
> attachments.
> For example:
> "
> Content-Type: TEXT/mspowerpoint; name="Zapatec_6zap_netvibes_1.ppt"
> "
> Indexing for this messages is stop with "fts_solr: Indexing failed: 500 
> Internal Server Error".
> In solr log is:
> "
> SEVERE: java.io.IOException: Mark invalid
> at java.io.BufferedReader.reset(BufferedReader.java:485)
> "
> (mail list with discussion: http://markmail.org/message/2fnfiwygvehjngyr#
> query:SEVERE%3A%20java.io.IOException%3A%20Mark%20invalid%20lucene+page:1+
> mid:2fnfiwygvehjngyr+state:results)
> 
> Look's like dovecot try to index attachments like this.
> Also for some messages we have same error.
> Dovecot stop indexing of box and each search we have lag and CPU load on 
> server.
> 
> So we need to make dovecot more "stable" to this error.
> For first time , will be good, just ignore problematic messages with error 
> from solr.
> 
> Let's discuss this issue, because this is general problem.
> We ready to explore code where needed , etc.
> 
> Regards,
> Nikolai
> 
> Powered by the 6zap. Sign up at http://www.6zap.com for an account that 
> provides advanced e-mail, calendar and contacts capabilities.

Powered by the 6zap. Sign up at http://www.6zap.com for an account that provides advanced e-mail, calendar and contacts capabilities.


More information about the dovecot mailing list