Package edu.jhu.nlp.wikipedia

Interface Summary
PageCallbackHandler Interface to allow streamed processing of pages.
 

Class Summary
InfoBox A class abstracting Wiki infobox
IteratorHandler  
SAXPageCallbackHandler A Wrapper class for the PageCallbackHandler
WikiPage Data structures for a wikipedia page.
WikiPageIterator A class to iterate the pages after the wikipedia XML file has been parsed with WikiXMLDOMParser.
WikiTextParser For internal use only -- Used by the WikiPage class.
WikiXMLDOMParser A memory efficient parser for easy access to Wikipedia XML dumps in native and compressed XML formats.
Typical pattern of use:
WikiXMLParser  
WikiXMLParserFactory  
WikiXMLSAXParser A SAX Parser for Wikipedia XML dumps.