Posts Tagged ‘document management’

KnowledgeTree 3.7.0.2 Document Indexing and Indexer issues (SOLVED)


2010
03.22

Argh. Well, it’s really not quite out of the box, at least on Ubutu Server 9.10 (after reinstallations required after initial failures…), so this is just a selection of the fixes that made the import and indexing of the 160 000 files at 102GB possible. So for KnowledgeTree 3.7.0.2 Commercial Edition (the same holds true for the Community Edition), the following should help:

  1. Use the best-practice advice when doing the local file system import – rather do 10 000 files at a time rather than 100 000 at once. Really. Trust me. It defeats the whole idea of just running a batch job. Completely. As you’d expect the option of saying – just transfer all data in directory X. But alas, that doesn’t work. So do it in batches. Manually.
  2. Tika Apache Indexer for Lucene – not so much on PDFs, Docs, XLS or PPT files. Install catdoc (which includes catppt and xls2csv) and pdftotext (which you’ll find in xpdf-utils).
    1. apt-get install catdoc pdftotext
    2. modify knowledgetree/search2/indexing/extractors/TikaApacheExtractor.inc.php and comment out the mime types that are affected above from the returned array in getSupportedMimeTypes() – PDF, XLS, DOC and PPT: (more…)

Knowledge Tree 3.7 Community Edition released


2009
11.25

PBGVVH25US66 They released the Knowledge Tree 3.7 RC Community Edition some time ago, which now runs on the full Zend-Server stack! Infrastructurally, this is the biggest changed; PHP-based setup wizards also feature, and sure – you can now run it on IIS6/IIS7, but that switch to the Zend stack with use of Optimiser features most highly for me on this development. But you knew that already! :)

Curious to see the feedback from Steve Briggs from iBurst about their KnowledgeTree (not necessarily their whole CRM solution) usage and implementation experience at iBurst, after the chat I had with him yesterday…