SemaText versus BasisTech

View: New views
3 Messages — Rating Filter:   Alert me  

SemaText versus BasisTech

by silke :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

Hi,

We are looking for multi lingual search support - German, English, French in a single Solr instance. I have been told that we need to use third party products like Sematext and BasisTech. Which one would you recommend we use between the two of them? Also, what are the pros an cons of using these products.

Any pointers would be greatly helpful.

Thanks
Silke

Re: SemaText versus BasisTech

by Hannes Carl Meyer-2 :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

Hi Silke,

what do you mean with multi lingual search support?

Remember, with the MultiCore (http://wiki.apache.org/solr/CoreAdmin) you can
have multiple indexes and configurations for preprocessing in one Solr
instance. I used it a lot for such cases with multiple language support.

Bests

Hannes

On Wed, Nov 4, 2009 at 4:55 PM, silke <steinbachsilke@...> wrote:

>
> Hi,
>
> We are looking for multi lingual search support - German, English, French
> in
> a single Solr instance. I have been told that we need to use third party
> products like Sematext and BasisTech. Which one would you recommend we use
> between the two of them? Also, what are the pros an cons of using these
> products.
>
> Any pointers would be greatly helpful.
>
> Thanks
> Silke
> --
> View this message in context:
> http://old.nabble.com/SemaText-versus-BasisTech-tp26198959p26198959.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>
>

Parent Message unknown Re: SemaText versus BasisTech

by Otis Gospodnetic :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

Hello Silke,

If you need to index/search content in multiple language, you do need to handle each language (both documents and queries) differently and correctly for each language.  Sematext does offer an affordable multilingual indexer product and language identifier - http://www.sematext.com/products/multilingual-indexer/index.html .  BasisTech has more advanced linguistic support (things like lemmatization, morphological analysis...), but this is also several orders of magnitude more expensive.  Have a look at http://www.basistech.com/.../Lucene-Solr-for-the-Rest-of-the-World.pdf (although it looks like some of the statements about Lucene are no longer true).

Otis
--
Sematext is hiring -- http://sematext.com/about/jobs.html?mls
Lucene, Solr, Nutch, Katta, Hadoop, HBase, UIMA, NLP, NER, IR



----- Original Message ----

> From: silke <steinbachsilke@...>
> To: solr-user@...
> Sent: Wed, November 4, 2009 10:55:12 AM
> Subject: SemaText versus BasisTech
>
>
> Hi,
>
> We are looking for multi lingual search support - German, English, French in
> a single Solr instance. I have been told that we need to use third party
> products like Sematext and BasisTech. Which one would you recommend we use
> between the two of them? Also, what are the pros an cons of using these
> products.
>
> Any pointers would be greatly helpful.
>
> Thanks
> Silke
> --
> View this message in context:
> http://old.nabble.com/SemaText-versus-BasisTech-tp26198959p26198959.html
> Sent from the Solr - User mailing list archive at Nabble.com.