JAVA Problem In this exercise, you will create a simple spell checker program. Y
ID: 3816864 • Letter: J
Question
JAVA Problem
In this exercise, you will create a simple spell checker program. Your program will have the following characteristics:
Your program will be capable of reading multiple files to be spell checked from command line arguments.
When your program starts up, you will read in the dictionary.txt file which contains a list of known words. Only read in this file once. Store these words in a HashSet data structure.
Create a TreeSet to store any miss-spelled words that you find.
For each file in the command line parameter list, print out the name of the file before reading in any words.
For each file parse out each of the words in the file and check to see if the word is in the HashSet you created with the dictionary.txt words or its already in your list of miss-spelled words.
If the word is in the dictionary HashSet or the miss-spelled word TreeSet then proceed to the next word.
If the word is not in the dictionary or the miss-spelled word list, give your user 2 choices: Add the word to the HashSet dictionary, or add the word to the list of miss-spelled words.
At the end of each file processed, dump out the list of miss-spelled words.
When you then start to read in the next file, make sure you clear the list of miss-spelled words. Keep any words added to the dictionary HashSet.
More details:
In parsing the words in these text files, you will need to filter out punctuation characters. I used the StringTokenizer, and I called the following constructor:
StringTokenizer st = new StringTokenizer(line, " ,.;:-%'"");
before checking the word against the dictionary, make sure you lower case the word
if the "word" doesn't start out with a letter (i.e. >= 'a' And <= 'z'), then skip it because it is likely a number
If the word couldn't be found in either list and ends with an 's', try removing the 's' to create a singular version of the word and try again to find the word in your dictionary or miss-spelled word list using the singular version of the word.
Only print out lines that contain a word needing a user response. See the Sample output below from the published answer.
Use the following dictionary file
dictionary.txt
You can find the input files to test from the following links:
Russia.txt
Ukraine.txt
Your command line arguments will be:
Russia.txt Ukraine.txt
When you execute your program to publish your output, answer "n" to the question of whether to add the word to the dictionary for all words except the for the word "christianity" .
See the sample output below:
Some of the output from running this program should look something like:
First part of the output from running the published answer
For those of you who like to see a template of the published answer, here it is:
Explanation / Answer
//SpellCheck.java
import java.io.File;
import java.io.FileNotFoundException;
import java.util.HashSet;
import java.util.Iterator;
import java.util.Scanner;
import java.util.StringTokenizer;
import java.util.TreeSet;
public class SpellCheck {
private HashSet<String> dictionary = new HashSet<String>();
private TreeSet<String> miss_spelled_words = new TreeSet<String>();
public SpellCheck() throws FileNotFoundException
{
try (Scanner sc = new Scanner(new File("dictionary.txt"))){
while (sc.hasNextLine()) {
String line = sc.nextLine();
dictionary.add(line);
}
sc.close();
}
}
public void checkSpelling(String fileName) throws FileNotFoundException
{
System.out.println("======== Spell checking " + fileName + " =========");
miss_spelled_words.clear();
String line = "";
int lineNumber = 1;
Scanner input = new Scanner(new File(fileName));
Scanner userInput = new Scanner(System.in);
while (input.hasNextLine()){
line = input.nextLine();
StringTokenizer st = new StringTokenizer(line, " () ,.;:-%'"");
while (st.hasMoreTokens())
{
String word = st.nextToken();
word = word.toLowerCase();
if(Character.isLetter(word.charAt(0)))
{
if (dictionary.contains(word)==false && miss_spelled_words.contains(word)==false)
{
if (word.charAt(word.length() - 1) == 's')
{
word = word.substring(0, word.length() -1);
}
System.out.println(lineNumber + " : " + line);
lineNumber += 1;
System.out.println(word + " not in dictionary. Add to dictionary? (y/n)");
char command = userInput.next().charAt(0);
if (command == 'y'){
dictionary.add(word);
System.out.println(word + " has been added to the dictionary.");
System.out.println();
}
else{
System.out.println();
miss_spelled_words.add(word);
}
}
}
}
}
input.close();
}
public void dump_miss_spelled_words()
{
Iterator iterator = miss_spelled_words.iterator();
while (iterator.hasNext())
System.out.println(iterator.next());
}
public static void main(String[] args) {
try {
SpellCheck spellCheck = new SpellCheck();
for (int i=0; i < args.length; i++)
{
spellCheck.checkSpelling(args[i]);
spellCheck.dump_miss_spelled_words();
}
}
catch (FileNotFoundException e)
{
System.out.println(e);
}
}
}
================================================================
Related Questions
Navigate
Integrity-first tutoring: explanations and feedback only — we do not complete graded work. Learn more.