【Java开源代码栏目提醒】:网学会员为广大网友收集整理了,ParserConvert.java,希望对大家有所帮助!
//ParserConvert.java
package extractors;
import org.htmlparser.Parser;
import org.htmlparser.util.ParserException;
import org.htmlparser.visitors.TextExtractingVisitor;
public class ParserConvert
{
public static void main (String[] args) throws Exception
{
String file = "ta.htm";
String s = getText(file);
System.out.println (s);
}
public static String getText(String f) throws Exception
{
Parser parser = new Parser (f);
TextExtractingVisitor visitor = new TextExtractingVisitor ();
parser.visitAllNodesWith (visitor);
String s = visitor.getExtractedText();
s = new String(s.getBytes("iso-8859-1"));
return s;
}
}