On Wed, Sep 19, 2007 at 05:55:20PM -0400, Dave Koontz wrote:
> Theo and all. I know this topic comes up on occasion, but I am not sure
> I've ever seen an explanation as to why the bayes_seen file is not auto
> pruned along with the bayes db file. Since tokens expire in the main DB
> file, what is the purpose of having a seen file to unlearn tokens which
> may have long ago been purged? IMO, it would seem logical to also
> purge the seen file at some sort of cycle so it can't grow so
> excessively large.
Sure, patches welcome. :)
Seriously, it would require someone to write the code to deal with
expiry, and to upgrade people's seen files (or otherwise handle that
situation), etc. At a minimum, just adding in a timestamp would help,
but if you wanted to have some mapping of tokens to message, then that's
a whole huge thing. Oh, and you'd need to support SQL and DBM, of course.
Since you can just rm the seen file or do a "delete from" in SQL, and
have everything continue to work, it hasn't been considered a priority.
But if you think it's important enough to get in, we're happy to accept
the patch to implement it.
--
Randomly Selected Tagline:
Forgetfulness, n.:
A gift of God bestowed upon debtors in compensation for
their destitution of conscience.