On 6-Sep-07, at 3:25 PM, Mike Klaas wrote:
>
> There are essentially two facet computation strategies:
>
> 1. cached bitsets: a bitset for each term is generated and
> intersected with the query restul bitset. This is more general and
> performs well up to a few thousand terms.
>
> 2. field enumeration: cache the field contents, and generate counts
> using this data. Relatively independent of #unique terms, but
> requires at most a single facet value per field per document.
>
> So, if you factor author into Primary author/Secondary author,
> where each is guaranteed to only have one value per doc, this could
> greatly accelerate your faceting. There are probably fewer unique
> subjects, so strategy 1 is likely fine.
>
> To use strategy 2, just make sure that multivalued="false" is set
> for those fields in schema.xml
I forgot to mention that strategy 2 also requires a single token for
each doc (see
http://wiki.apache.org/solr/
FAQ#head-14f9f2d84fb2cd1ff389f97f19acdb6ca55e4cd3)
-Mike