• Home
  • About

Saved searches

  • Remove
  • Verified account @
Suggested users
  • Verified account @
  • Verified account @
  • Language: English
    • Bahasa Indonesia
    • Bahasa Melayu
    • Català
    • Čeština
    • Dansk
    • Deutsch
    • English UK
    • Español
    • Filipino
    • Français
    • Hrvatski
    • Italiano
    • Magyar
    • Nederlands
    • Norsk
    • Polski
    • Português
    • Română
    • Slovenčina
    • Suomi
    • Svenska
    • Tiếng Việt
    • Türkçe
    • Ελληνικά
    • Български език
    • Русский
    • Српски
    • Українська мова
    • עִבְרִית
    • العربية
    • فارسی
    • मराठी
    • हिन्दी
    • বাংলা
    • ગુજરાતી
    • தமிழ்
    • ಕನ್ನಡ
    • ภาษาไทย
    • 한국어
    • 日本語
    • 简体中文
    • 繁體中文
  • Have an account? Log in
    Have an account?
    · Forgot password?

    New to Twitter?
    Sign up
By using Twitter’s services you agree to our Cookie Use and Data Transfer outside the EU. We and our partners operate globally and use cookies, including for analytics, personalisation, and ads.
thatcks's profile
Chris Siebenmann
Chris Siebenmann
Chris Siebenmann
@thatcks

Chris Siebenmann

@thatcks

That cks. Overcommitted sysadmin, photographer, bicyclist, and other multitudes. I write a lot of words for a programmer.

Joined December 2011
  • © 2016 Twitter
  • About
  • Help
  • Terms
  • Privacy
  • Cookies
  • Ads info
Dismiss
Previous
Next

Go to a person's profile

Saved searches

  • Remove
  • Verified account @
Suggested users
  • Verified account @
  • Verified account @

Retweet this to your followers?

Optional comment for Retweet
 
 

Saved searches

  • Remove
  • Verified account @
Suggested users
  • Verified account @
  • Verified account @
140

Are you sure you want to delete this Tweet?

Promote this Tweet

Block

  • Add a location to your Tweets

    When you tweet with a location, Twitter stores that location. You can switch location on/off before each Tweet and always have the option to delete your location history. Learn more

    Profile summary

    Your lists

    Create a new list


    Under 100 characters, optional

    Privacy

    Your reply includes the people in this conversation up to this point. Learn more

    Copy link to Tweet

    Embed this Tweet

    Embed this Video

    Add this Tweet to your website by copying the code below. Learn more

    Add this video to your website by copying the code below. Learn more

    Hmm, there was a problem reaching the server.

    Preview

    Log in to Twitter

    · Forgot password?
    Don't have an account? Sign up »

    Sign up for Twitter

    Not on Twitter? Sign up, tune into the things you care about, and get updates as they happen.

    Sign up
    Have an account? Log in »

    Two-way (sending and receiving) short codes:

    Country Code For customers of
    United States 40404 (any)
    Canada 21212 (any)
    United Kingdom 86444 Vodafone, Orange, 3, O2
    Brazil 40404 Nextel, TIM
    Haiti 40404 Digicel, Voila
    Ireland 51210 Vodafone, O2
    India 53000 Bharti Airtel, Videocon, Reliance
    Indonesia 89887 AXIS, 3, Telkomsel, Indosat, XL Axiata
    Italy 4880804 Wind
    3424486444 Vodafone
    » See SMS short codes for other countries

    Confirmation

     

    Buy Now

    Hmm... Something went wrong. Please try again.

    Previous Tweet Next Tweet
    Chris Siebenmann ‏@thatcks 20 Oct 2015

    Does anyone know of a good replacement for swish-e for indexing and searching HTML pages? Must be still being developed/maintained.

    1:37 PM - 20 Oct 2015
    0 retweets 0 likes
      1. Chris Siebenmann ‏@thatcks 20 Oct 2015

        What we really want to do is take a mbox, make its emails into HTML files, and be able to search for things & get links to the files/etc.

        0 retweets 0 likes
      2. Grant Taylor ‏@drscriptt 20 Oct 2015

        @thatcks dare I say IMAP, including imap:// URLs? Any MUA worth it's bits should be able to handle it well.

        0 retweets 0 likes
      3. Chris Siebenmann ‏@thatcks 20 Oct 2015

        @drscriptt We need all of this done through the web, partly so that old messages have (fixed) URLs.

        0 retweets 0 likes
      4. View other replies
      5. Grant Taylor ‏@drscriptt 20 Oct 2015

        @thatcks is your data an mbox file(s) now? Are you willing to convert to something else?

        0 retweets 0 likes
      6. Chris Siebenmann ‏@thatcks 20 Oct 2015

        @drscriptt Data is mbox files that periodically get new messages (although future new messages could be handled another way).

        0 retweets 0 likes
      1. Chris Siebenmann ‏@thatcks 20 Oct 2015

        Swish-e has been working nicely for us but it's unmaintained and also a horror of 32-bit issues and so unsuitable for a 64-bit world.

        0 retweets 0 likes
      2. Chris Siebenmann ‏@thatcks 20 Oct 2015

        To clarify: we only have well less than a GB of text to index and search, and we'd like something relatively simple and small and CGI-ish.

        0 retweets 0 likes
      1. rone ‏@rone 20 Oct 2015

        @thatcks Apache Solr?

        0 retweets 0 likes
      2. Chris Siebenmann ‏@thatcks 20 Oct 2015

        @rone That seems kind of heavyweight for us; we have only less than 600 MB of text to index.

        0 retweets 0 likes
    1. Grant Taylor ‏@drscriptt 20 Oct 2015

      @thatcks I've used GNOME (2)'s indexer before. I think "Tracker" was its name. It was there, worked, didn't use Java.

      0 retweets 0 likes
    2. Jorge Luis ‏@jorgelbgm 21 Oct 2015

      @thatcks elastic search could be also an option, actually Solr is not that heavy, disabling what you don’t need

      0 retweets 0 likes

    Loading seems to be taking a while.

    Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.

      Promoted Tweet

      false

      • © 2016 Twitter
      • About
      • Help
      • Terms
      • Privacy
      • Cookies
      • Ads info