beagle indexes and cache

View: New views
5 Messages — Rating Filter:   Alert me  

beagle indexes and cache

by Lorenzo Bettini :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

Hi

does beagle cleans up its indexes and textcaches now and then?
I mean it looks like old documents (now removed) still come up when
searching...

is there a way to clean indexes and textcache, or should I simply remove
these directories in the .beagle folder now and then?

thanks
        Lorenzo

--
Lorenzo Bettini, PhD in Computer Science, DI, Univ. Torino
ICQ# lbetto, 16080134     (GNU/Linux User # 158233)
HOME: http://www.lorenzobettini.it MUSIC: http://www.purplesucker.com
http://www.myspace.com/supertrouperabba
BLOGS: http://tronprog.blogspot.com  http://longlivemusic.blogspot.com
http://www.gnu.org/software/src-highlite
http://www.gnu.org/software/gengetopt
http://www.gnu.org/software/gengen http://doublecpp.sourceforge.net

_______________________________________________
Dashboard-hackers mailing list
Dashboard-hackers@...
http://mail.gnome.org/mailman/listinfo/dashboard-hackers

Re: beagle indexes and cache

by Adam Tauno Williams :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

> does beagle cleans up its indexes and textcaches now and then?
> I mean it looks like old documents (now removed) still come up when
> searching...
> is there a way to clean indexes and textcache, or should I simply remove
> these directories in the .beagle folder now and then?

It should, but I have had the indexes get stale, especially if I worked
for awhile with the beagle service turned off.

It helps to occasionally stop the Beagle service, open a shell,  "export
BEAGLE_EXERCISE_THE_DOG=1", run "beagle-search" (yes - from the shell!),
then start the service.  This will make the service index aggressively
as fast as it can;  bringing the index up to date.  I do this on my
laptop when I go to bed - then in the morning stop the service, close
the shell, run beagle-search from the GUI (GNOME-Do!) and restart the
service [now running in an environment without BEAGLE_EXERCISE_THE_DOG
set] so it is back to running as a low-priority background service.

You can also stop the service, "rm -R ~/.beagle", "export
BEAGLE_EXERCISE_THE_DOG=1", beagle-search, start the service - to
rebuild all your indexes from scratch as fast as it can.
--
openSUSE <http://www.opensuse.org/en/>
Linux for human beings who need to get things done.


_______________________________________________
Dashboard-hackers mailing list
Dashboard-hackers@...
http://mail.gnome.org/mailman/listinfo/dashboard-hackers

signature.asc (204 bytes) Download Attachment

Re: beagle indexes and cache

by D Bera :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

>> does beagle cleans up its indexes and textcaches now and then?
>> I mean it looks like old documents (now removed) still come up when
>> searching...
>> is there a way to clean indexes and textcache, or should I simply remove
>> these directories in the .beagle folder now and then?
>
> It should, but I have had the indexes get stale, especially if I worked
> for awhile with the beagle service turned off.

That is right. If beagle service is running, it will monitor new files
that are created and existing files that are removed almost
immediately. If it is turned off, and then turned on again, then the
changes while it was turned off are handled later:
- new files are reported only during the next re-crawl
- deleted files are not reported but they are removed from the beagle
database much later and only when such a file comes up a search result

In short, deleted files should not come up as a search result (I dont
remember, but there was a bug in 0.3.7 or 0.3.8 which can cause
something like that).

Removing textcache and indexes manually should not be necessary,
assuming there is no bug (and even less reason if beagle is running
continuously). Note that the indexes are separated into backends and
can be easier to remove but the textcache is not, so removing the
textcache directory will wipe out all of textcache.

- dBera
_______________________________________________
Dashboard-hackers mailing list
Dashboard-hackers@...
http://mail.gnome.org/mailman/listinfo/dashboard-hackers

Re: beagle indexes and cache

by Lorenzo Bettini :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

D Bera wrote:

>>> does beagle cleans up its indexes and textcaches now and then?
>>> I mean it looks like old documents (now removed) still come up when
>>> searching...
>>> is there a way to clean indexes and textcache, or should I simply remove
>>> these directories in the .beagle folder now and then?
>> It should, but I have had the indexes get stale, especially if I worked
>> for awhile with the beagle service turned off.
>
> That is right. If beagle service is running, it will monitor new files
> that are created and existing files that are removed almost
> immediately. If it is turned off, and then turned on again, then the
> changes while it was turned off are handled later:
> - new files are reported only during the next re-crawl
> - deleted files are not reported but they are removed from the beagle
> database much later and only when such a file comes up a search result
>
> In short, deleted files should not come up as a search result (I dont
> remember, but there was a bug in 0.3.7 or 0.3.8 which can cause
> something like that).
>
> Removing textcache and indexes manually should not be necessary,
> assuming there is no bug (and even less reason if beagle is running
> continuously). Note that the indexes are separated into backends and
> can be easier to remove but the textcache is not, so removing the
> textcache directory will wipe out all of textcache.
>

thanks for your answers!

I was worried, though, about files being indexed that are in a directory
which is later marked as ignored with beagle-settings: will those files
well be removed from the beagle database?

thanks again
        Lorenzo

--
Lorenzo Bettini, PhD in Computer Science, DI, Univ. Torino
ICQ# lbetto, 16080134     (GNU/Linux User # 158233)
HOME: http://www.lorenzobettini.it MUSIC: http://www.purplesucker.com
http://www.myspace.com/supertrouperabba
BLOGS: http://tronprog.blogspot.com  http://longlivemusic.blogspot.com
http://www.gnu.org/software/src-highlite
http://www.gnu.org/software/gengetopt
http://www.gnu.org/software/gengen http://doublecpp.sourceforge.net

_______________________________________________
Dashboard-hackers mailing list
Dashboard-hackers@...
http://mail.gnome.org/mailman/listinfo/dashboard-hackers

Re: beagle indexes and cache

by D Bera :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

>>>> does beagle cleans up its indexes and textcaches now and then?
>>>> I mean it looks like old documents (now removed) still come up when
>>>> searching...
>>>> is there a way to clean indexes and textcache, or should I simply remove
>>>> these directories in the .beagle folder now and then?
...
>
> I was worried, though, about files being indexed that are in a directory
> which is later marked as ignored with beagle-settings: will those files well
> be removed from the beagle database?

They would definitely not show up in search results - the search
results are filtered based on current availability and beagle-settings
ignore list.
As far as actually removing them from the database is concerned, I
don't remember off the top of my head how it is dealt with. Ideally,
the entries should be removed since the index (and esp. the textcache)
might retain information that the user doesnt want it to contain ...
but ignoring a directory involves traversing all files and directories
beneath it which soon creates a nightmare. I remember seeing some code
on dropping the contents of the ignored directories but I am can't
remember if it is complete or what exactly it does.

- dBera

--
-----------------------------------------------------
Debajyoti Bera @ http://dtecht.blogspot.com
beagle / KDE fan
Mandriva / Inspiron-1100 user
_______________________________________________
Dashboard-hackers mailing list
Dashboard-hackers@...
http://mail.gnome.org/mailman/listinfo/dashboard-hackers