What we really want to do is take a mbox, make its emails into HTML files, and be able to search for things & get links to the files/etc.
Does anyone know of a good replacement for swish-e for indexing and searching HTML pages? Must be still being developed/maintained.
-
-
-
@thatcks dare I say IMAP, including imap:// URLs? Any MUA worth it's bits should be able to handle it well. -
@drscriptt We need all of this done through the web, partly so that old messages have (fixed) URLs. - View other replies
-
@thatcks is your data an mbox file(s) now? Are you willing to convert to something else? -
@drscriptt Data is mbox files that periodically get new messages (although future new messages could be handled another way).
-
-
-
Swish-e has been working nicely for us but it's unmaintained and also a horror of 32-bit issues and so unsuitable for a 64-bit world.
-
To clarify: we only have well less than a GB of text to index and search, and we'd like something relatively simple and small and CGI-ish.
-
-
@thatcks I've used GNOME (2)'s indexer before. I think "Tracker" was its name. It was there, worked, didn't use Java. -
@thatcks elastic search could be also an option, actually Solr is not that heavy, disabling what you don’t need
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.
Chris Siebenmann
Grant Taylor
rone
Jorge Luis