Package com.googlecode.gaal.preprocess.impl

Examples of com.googlecode.gaal.preprocess.impl.LowerCaseNormalizer


    }

    public static Corpus<String> createSmallCorpus() throws FileNotFoundException {
        FileReader reader = new FileReader("data/tlg.txt");

        Tokenizer<String> tokenizer = new RegexTokenizer(reader, Analyser.STRING_REGEX, new LowerCaseNormalizer());
        return new TreeMapCorpus(tokenizer, SEPARATORS);
    }
View Full Code Here


    }

    public static Corpus<String> createLargeCorpus() throws FileNotFoundException {
        FileReader reader = new FileReader("data/moby.txt");

        Tokenizer<String> tokenizer = new RegexTokenizer(reader, Analyser.STRING_REGEX, new LowerCaseNormalizer());
        return new TreeMapCorpus(tokenizer, SEPARATORS);
    }
View Full Code Here

TOP

Related Classes of com.googlecode.gaal.preprocess.impl.LowerCaseNormalizer

Copyright © 2018 www.massapicom. All rights reserved.
All source code are property of their respective owners. Java is a trademark of Sun Microsystems, Inc and owned by ORACLE Inc. Contact coftware#gmail.com.