Examples of TrecDoc


Examples of uk.ac.ucl.panda.indexing.io.TrecDoc

   
               
    Document doc =new Document();

   
    TrecDoc docMaker = new TrecDoc(data);
      

       appProp.setProperty("doc.maker.forever", "false");
   
       Config config = new Config(appProp);
       docMaker.setConfig(config);
       HTMLParser htmlParser = (HTMLParser) Class.forName(config.get("html.parser","uk.ac.ucl.panda.applications.demo.DemoHTMLParser")).newInstance();
       docMaker.setHTMLParser(htmlParser);
      
       IndexWriter writer = new IndexWriter(indexDir,
          new PorterStemAnalyzer(), true);
      writer.setUseCompoundFile(false);
     
     long start = new Date().getTime();
       while ((doc = docMaker.makeDocument()) != null) {
         writer.addDocument(doc);// add Document to index   
       }

   
   
View Full Code Here
TOP
Copyright © 2018 www.massapi.com. All rights reserved.
All source code are property of their respective owners. Java is a trademark of Sun Microsystems, Inc and owned by ORACLE Inc. Contact coftware#gmail.com.