lucenenet-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Itamar Syn-Hershko <ita...@code972.com>
Subject Re: Lucene.NET 4.8 - a pre-release introduction (video + code)
Date Wed, 14 Dec 2016 19:39:51 GMT
No, analyzers are plain-text processors. You will need to transform binary
formats to plain text yourself. Tika is a great starting point.

--

Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Lucene.NET committer and PMC member

On Wed, Dec 14, 2016 at 5:22 PM, Francesco Abbruzzese <
frankabbruzzese@gmail.com> wrote:

> Hi Itamar ,
>
> Are there analyzers (or text filters)  for all more common type of
> documents,ie word, pdf, ppt, etc one may use with the new lucene .net
> version?
>
>
> 2016-12-13 13:59 GMT+01:00 Itamar Syn-Hershko <itamar@code972.com>:
>
>> We are about to release Lucene.NET 4.8, and it's time to show what it can
>> do, and how it can be done.
>>
>> I just published a walkthrough video on Channel 9, you can watch it here:
>> https://channel9.msdn.com/Blogs/MVP-VisualStudio-Dev/LuceneN
>> ET-48-a-pre-release-introduction
>>
>> The Demo application can be found at https://github.com/synhershko/
>> LuceneNetDemo
>>
>> nuget packages can be downloaded from https://myget.org/gallery/luce
>> ne-net
>>
>> Comments? questions? reach out to us on our mailing lists:
>> http://lucenenet.apache.org/community.html
>>
>> Enjoy!
>>
>> --
>>
>> Itamar Syn-Hershko
>> http://code972.com | @synhershko <https://twitter.com/synhershko>
>> Freelance Developer & Consultant
>> Lucene.NET committer and PMC member
>>
>>
>
>
> --
> Francesco Abbruzzese
> francesco@dotnet-programming.com
> http://www.dotnet-programming.com/
> https://github.com/MvcControlsToolkit
> http://mvccontrolstoolkit.codeplex.com/
>
>
>
>
>
>
>
>
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message