I’ve beein refreshing my string algorithm knowledge recently, so as a training I started a new project on github . I’ll probably abandon it as soon as I start regular work again but anyway it’s a demo on how file indexing could look if we had tons of RAM available to keep enhanced suffix array structure in-memory for fast exact substring queries. I’ve also written a cute little text-mode console client, using python curses library.