javafile-ioutf-8character-encodingwindows-1252

How to read a file in Java with specific character encoding?


I am trying to read a file in as either UTF-8 or Windows-1252 depending on the output of this method:

public Charset getCorrectCharsetToApply() {
    // Returns a Charset for either UTF-8 or Windows-1252.
}

So far, I have:

String fileName = getFileNameToReadFromUserInput();
InputStream is = new ByteArrayInputStream(fileName.getBytes());
InputStreamReader isr = new InputStreamReader(is, getCorrectCharsetToApply());
BufferedReader buffReader = new BufferedReader(isr);

The problem I'm having is converting the BufferedReader instance to a FileReader.

Furthermore:

Thanks in advance!


Solution

  • So, first, as a heads up, do realize that fileName.getBytes() as you have there gets the bytes of the filename, not the file itself.

    Second, reading inside the docs of FileReader:

    The constructors of this class assume that the default character encoding and the default byte-buffer size are appropriate. To specify these values yourself, construct an InputStreamReader on a FileInputStream.

    So, sounds like FileReader actually isn't the way to go. If we take the advice in the docs, then you should just change your code to have:

    String fileName = getFileNameToReadFromUserInput();
    FileInputStream is = new FileInputStream(fileName);
    InputStreamReader isr = new InputStreamReader(is, getCorrectCharsetToApply());
    BufferedReader buffReader = new BufferedReader(isr);
    

    and not try to make a FileReader at all.