VB icon

Html tag stripper

Email
Submitted on: 1/3/2015 9:23:00 AM
By: John Hunsley (from psc cd)  
Level: Beginner
User Rating: By 5 Users
Compatibility: Java (JDK 1.2)
Views: 1605
 
     reads in a Html file, removes the tags and sends the output to an output.txt file
 
code:
Can't Copy and Paste this?
Click here for a copy-and-paste friendly version of this code!
				
//**************************************
// Name: Html tag stripper
// Description:reads in a Html file, removes the tags and sends the output to an output.txt file
// By: John Hunsley (from psc cd)
//
// Inputs:a Html file
//
// Returns:a text file
//
// Assumes:only works on JSDK 1.4 and above
//**************************************

/*
 *this program reads in a text file,
 *puts a carrage return in at the end of 
 *each sentance. then sends that it to
 *an output text file
 *@author John Hunsley
 *@version 1.0 25/02/03
 *
*/
import java.util.*;
import java.io.*;
class HtmlStripper
{
	public static void main(String args[])
	{
		StringBuffer fileAsStringBuffer = new StringBuffer();//string buffer for file input
		String fileAsString = "intialized";//the file as a string
				
		/*****************read the whole file into a StringBuffer, line by line********************/
		try{
			
			BufferedReader input = new BufferedReader(
									 new FileReader("text.txt"));
			String line;
			while((line = input.readLine()) != null)
			{
				fileAsStringBuffer.append(line);
			}
			input.close();//close the buffer
			
			System.out.println("file saved as string");
			
			fileAsString = fileAsStringBuffer.toString();//set the String in the buffer as a String
		
		}
		catch(IOException e){
			System.out.println("IO Exception occured");
		}
/************strip tags******************/
fileAsString = fileAsString.replaceAll("\\<.*?\\>","");//strips all html tags
//write the String out to a text file
		try{
			
			File file = new File("output.txt");
			PrintWriter output = new PrintWriter(new FileWriter(file));
			output.println(fileAsString);
			output.close();
			System.out.println("String written to output text file");
			JOptionPane.showMessageDialog(null,
							 "String written to output text file");
		}
		catch(IOException e){
			System.out.println("IO Exception occured");
		}
	
}


Other 1 submission(s) by this author

 


Report Bad Submission
Use this form to tell us if this entry should be deleted (i.e contains no code, is a virus, etc.).
This submission should be removed because:

Your Vote

What do you think of this code (in the Beginner category)?
(The code with your highest vote will win this month's coding contest!)
Excellent  Good  Average  Below Average  Poor (See voting log ...)
 

Other User Comments


 There are no comments on this submission.
 

Add Your Feedback
Your feedback will be posted below and an email sent to the author. Please remember that the author was kind enough to share this with you, so any criticisms must be stated politely, or they will be deleted. (For feedback not related to this particular code, please click here instead.)
 

To post feedback, first please login.