manifoldcf-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Cihad Guzel <cguz...@gmail.com>
Subject extract email attachment
Date Tue, 07 Feb 2017 17:56:59 GMT
Hi,

I try the email connector with gmail. I attach the file [1] in my new
email. And sent to my test email adress.

My mail content body is like: "this is test mail for mfc"

Then I run my email job and the email is indexed to Solr successfully. But,
the solr's content field have not my attachment's content body. Solr
content filed looks like:

"content":" \n \n  \n  \n  \n  \n  \n  \n  \n \n
 --94eb2c1910841bc55f0547f43443\r\nContent-Type: multipart/alternative;
boundary=94eb2c1910841bc5530547f43441\r\n\r\n--94eb2c1910841bc5530547f43441\r\nContent-Type:
text/plain; charset=UTF-8\r\n\r\nthis is test mail for
mfc.\r\n\r\n--94eb2c1910841bc5530547f43441\r\nContent-Type: text/html;
charset=UTF-8\r\n\r\n<div dir=\"ltr\">this is test mail for
mfc.\r\n</div>\r\n\r\n--94eb2c1910841bc5530547f43441--\r\n--94eb2c1910841bc55f0547f43443\r\nContent-Type:
application/pdf; name=\"pdf-test.pdf\"\r\nContent-Disposition: attachment;
filename=\"pdf-test.pdf\"\r\nContent-Transfer-Encoding:
base64\r\nX-Attachment-Id:
f_iyvt78qa0\r\n\r\nJVBERi0xLjYNJeLjz9MNCjM3IDAgb2JqIDw8L0xpbmVhcml6ZWQgMS9MIDIwNTk3L08gNDAvRSAx\r\nNDExNS9OIDEvVCAxOTc5NS9IIFsgMTAwNSAyMTVdPj4NZW5kb2JqDSAgICAgICAgICAgICAgICAg\r\nDQp4cmVmDQozNyAzNA0KMDAwMDAwMDAxNiAwMDAwMCBuDQowMDAwMDAxMzg2IDAwMDAwIG4NCjAw\r\nMDAwMDE1MjIgMDAwM
..."

Does the MFC email connector know that the attachment's file type is pdf?
Does not extract the contents?

[1] http://www.orimi.com/pdf-test.pdf
-- 
Regards
Cihad G├╝zel

Mime
View raw message