edu.jhu.nlp.wikipedia
Interface PageCallbackHandler

All Known Implementing Classes:
DemoHandler, DemoSAXHandler, IteratorHandler

public interface PageCallbackHandler

Interface to allow streamed processing of pages. This allows a SAX style processing of Wikipedia XML files. The registered callback is executed on each page element in the XML file.

Using callbacks will consume lesser memory, an useful feature for large dumps like English and German.

See Also:
WikiXMLDOMParser, WikiPage

Method Summary
 void process(WikiPage page)
          This is the callback method that should be implemented before registering with WikiXMLDOMParser
 

Method Detail

process

void process(WikiPage page)
This is the callback method that should be implemented before registering with WikiXMLDOMParser

Parameters:
page - a wikipedia page object
See Also:
WikiPage