Most application programs that deal with sound need to read sound files or audio streams. This is common functionality, regardless of what the program may subsequently do with the data it reads (such as play, mix, or process it). Similarly, many programs need to write sound files (or streams). In some cases, the data that has been read (or that will be written) needs to be converted to a different format.
As was briefly mentioned in Chapter 3, "Accessing Audio System Resources," the JavaTM Sound API provides application developers with various facilities for file input/output and format translations. Application programs can read, write, and translate between a variety of sound file formats and audio data formats.
Chapter 2, "Overview of the Sampled Package," introduced the main classes related to sound files and audio data formats. As a review:
AudioInputStream
object. (AudioInputStream
inherits from java.io.InputStream
.)
AudioFormat
object.
This format specifies how the audio samples themselves are arranged, but not the structure of a file that they might be stored in. In other words, an AudioFormat
describes "raw" audio data, such as the system might hand your program after capturing it from a microphone input or after parsing it from a sound file. An AudioFormat
includes such information as the encoding, the byte order, the number of channels, the sampling rate, and the number of bits per sample.
AudioFileFormat
object. The AudioFileFormat
includes an AudioFormat
object to describe the format of the audio data stored in the file, and also includes information about the file type and the length of the data in the file.
AudioSystem
class provides methods for (1) storing a stream of audio data from an AudioInputStream
into an audio file of a particular type (in other words, writing a file), (2) extracting a stream of audio bytes (an AudioInputStream
) from an audio file (in other words, reading a file), and (3) converting audio data from one data format to another. The present chapter, which is divided into three sections, explains these three kinds of activity.
AudioSystem
class supplies methods that allow application programs to learn what conversions
are available, as described later in this chapter under "Converting
File and Data Formats."
The AudioSystem
class provides two types of file-reading services:
getAudioFileFormat
method:
static AudioFileFormat getAudioFileFormat (java.io.File file)
static AudioFileFormat getAudioFileFormat(java.io.InputStream stream)
static AudioFileFormat getAudioFileFormat (java.net.URL url)
As mentioned above, the returned AudioFileFormat
object tells
you the file type, the length of the data in the file, encoding, the byte order,
the number of channels, the sampling rate, and the number of bits per sample.
The second type of file-reading functionality is given
by these AudioSystem
methods
static AudioInputStream getAudioInputStream (java.io.File file)
static AudioInputStream getAudioInputStream (java.net.URL url)
static AudioInputStream getAudioInputStream (java.io.InputStream stream)
These methods give you an object (an AudioInputStream
) that lets
you read the file's audio data, using one of the read methods of AudioInputStream
.
We'll see an example momentarily.
Suppose you're writing a sound-editing application that allows the user to load sound data from a file, display a corresponding waveform or spectrogram, edit the sound, play back the edited data, and save the result in a new file. Or perhaps your program will read the data stored in a file, apply some kind of signal processing (such as an algorithm that slows the sound down without changing its pitch), and then play the processed audio. In either case, you need to get access to the data contained in the audio file. Assuming that your program provides some means for the user to select or specify an input sound file, reading that file's audio data involves three steps:
AudioInputStream
object from the file.
Let's take a look at what's happening in the above code sample. First, the outer try clause instantiates anint totalFramesRead = 0; File fileIn = new File(somePathName); // somePathName is a pre-existing string whose value was // based on a user selection. try { AudioInputStream audioInputStream = AudioSystem.getAudioInputStream(fileIn); int bytesPerFrame = audioInputStream.getFormat().getFrameSize(); // Set an arbitrary buffer size of 1024 frames. int numBytes = 1024 * bytesPerFrame; byte[] audioBytes = new byte[numBytes]; try { int numBytesRead = 0; int numFramesRead = 0; // Try to read numBytes bytes from the file. while ((numBytesRead = audioInputStream.read(audioBytes)) != -1) { // Calculate the number of frames actually read. numFramesRead = numBytesRead / bytesPerFrame; totalFramesRead += numFramesRead; // Here, do something useful with the audio data that's // now in the audioBytes array... } } catch (Exception ex) { // Handle the error... } } catch (Exception e) { // Handle the error... }
AudioInputStream
object through the call to the AudioSystem.getAudioInputStream(File)
method. This method transparently performs all of the testing required to determine whether the specified file is actually a sound file of a type that is supported by the Java Sound API. If the file being inspected (fileIn
in this example) is not a sound file, or is a sound file of some unsupported type, an UnsupportedAudioFileException
exception is thrown. This behavior is convenient, in that the application programmer need not be bothered with testing file attributes, nor with adhering to any file-naming conventions. Instead, the getAudioInputStream
method takes care of all the low-level parsing and verification that is required to validate the input file.
The outer try
clause then creates a byte array, audioBytes
, of an arbitrary fixed length. We make sure that its length in bytes equals an integral number of frames, so that we won't end up reading only part of a frame or, even worse, only part of a sample. This byte array will serve as a buffer to temporarily hold a chunk of audio data as it's read from the stream. If we knew we would be reading nothing but very short sound files, we could make this array the same length as the data in the file, by deriving the length in bytes from the length in frames, as returned by AudioInputStream's getFrameLength
method. (Actually, we'd probably just use a Clip
object instead.) But to avoid running out of memory in the general case, we instead read the file in chunks, one buffer at a time.
The inner try
clause contains a while
loop, which is where we read the audio data from the AudioInputStream
into the byte array. You should add code in this loop to handle the audio data in this array in whatever way is appropriate for your program's needs. If you're applying some kind of signal processing to the data, you'll probably need to query the AudioInputStream's AudioFormat
further, to learn the number of bits per sample and so on.
Note that the method AudioInputStream.read(byte[])
returns the number of bytes readnot the number of samples or
frames. This method returns -1 when there's no more data to read. Upon detecting
this condition, we break from the while
loop.
The previous section described the basics of reading a sound file, using specific methods of the AudioSystem
and AudioInputStream
classes. This section describes how to write audio data out to a new file.
The following AudioSystem
method creates a disk file of a specified file type. The file will contain the audio data that's in the specified AudioInputStream
:
Note that the second argument must be one of the file types supported by the system (for example, AU, AIFF, or WAV), otherwise thestatic int write(AudioInputStream in, AudioFileFormat.Type fileType, File out)
write
method will throw an IllegalArgumentException
. To avoid this, you can test whether or not a particular AudioInputStream
may be written to a particular type of file, by invoking this AudioSystem
method:
which will returnstatic boolean isFileTypeSupported (AudioFileFormat.Type fileType, AudioInputStream stream)
true
only if the particular combination is supported.
More generally, you can learn what types of file the system can write by invoking one of these AudioSystem
methods:
The first of these returns all the types of file that the system can write, and the second returns only those that the system can write from the given audio input stream.static AudioFileFormat.Type[] getAudioFileTypes() static AudioFileFormat.Type[] getAudioFileTypes(AudioInputStream stream)
The following excerpt demonstrates one technique for creating an output file from an AudioInputStream
using the write
method mentioned above.
The first statement above, creates a newFile fileOut = new File(someNewPathName); AudioFileFormat.Type fileType = fileFormat.getType(); if (AudioSystem.isFileTypeSupported(fileType, audioInputStream)) { AudioSystem.write(audioInputStream, fileType, fileOut); }
File
object, fileOut
,
with a user- or program-specified pathname. The second statement gets a file type
from a pre-existing AudioFileFormat
object called fileFormat
,
which might have been obtained from another sound file, such as the one that was
read in the "Reading Sound Files" section of
this chapter. (You could instead supply whatever supported file type you want,
instead of getting the file type from elsewhere. For example, you might delete
the second statement and replace the other two occurrences of fileType
in the code above with AudioFileFormat.Type.WAVE
.)
The third statement tests whether a file of the designated type can be written from a desired AudioInputStream
. Like the file format, this stream might have been derived from the sound file previously read. (If so, presumably you've processed or altered its data in some way, because otherwise there are easier ways to simply copy a file.) Or perhaps the stream contains bytes that have been freshly captured from the microphone input.
Finally, the stream, file type, and output file are passed to the AudioSystem
.write
method, to accomplish the goal of writing the file.
Recall from the section "What Is Formatted Audio Data?" in Chapter 2, "Overview of the Sampled Package," that the Java Sound API distinguishes between audio file formats and audio data formats. The two are more or less independent. Roughly speaking, the data format refers to the way in which the computer represents each raw data point (sample), while the file format refers to the organization of a sound file as stored on a disk. Each sound file format has a particular structure that defines, for example, the information stored in the file's header. In some cases, the file format also includes structures that contain some form of meta-data, in addition to the actual "raw" audio samples. The remainder of this chapter examines methods of the Java Sound API that enable a variety of file-format and data-format conversions.
This section covers the fundamentals of converting audio file types in the Java Sound API. Once again we pose a hypothetical program whose purpose, this time, is to read audio data from an arbitrary input file and write it into a file whose type is AIFF. Of course, the input file must be of a type that the system is capable of reading, and the output file must be of a type that the system is capable of writing. (In this example, we assume that the system is capable of writing AIFF files.) The example program doesn't do any data format conversion. If the input file's data format can't be represented as an AIFF file, the program simply notifies the user of that problem. On the other hand, if the input sound file is an already an AIFF file, the program notifies the user that there is no need to convert it.
The following function implements the logic just described:
public void ConvertFileToAIFF(String inputPath, String outputPath) { AudioFileFormat inFileFormat; File inFile; File outFile; try { inFile = new File(inputPath); outFile = new File(outputPath); } catch (NullPointerException ex) { System.out.println("Error: one of the ConvertFileToAIFF" +" parameters is null!"); return; } try { // query file type inFileFormat = AudioSystem.getAudioFileFormat(inFile); if (inFileFormat.getType() != AudioFileFormat.Type.AIFF) { // inFile is not AIFF, so let's try to convert it. AudioInputStream inFileAIS = AudioSystem.getAudioInputStream(inFile); inFileAIS.reset(); // rewind if (AudioSystem.isFileTypeSupported( AudioFileFormat.Type.AIFF, inFileAIS)) { // inFileAIS can be converted to AIFF. // so write the AudioInputStream to the // output file. AudioSystem.write(inFileAIS, AudioFileFormat.Type.AIFF, outFile); System.out.println("Successfully made AIFF file, " + outFile.getPath() + ", from " + inFileFormat.getType() + " file, " + inFile.getPath() + "."); inFileAIS.close(); return; // All done now } else System.out.println("Warning: AIFF conversion of " + inFile.getPath() + " is not currently supported by AudioSystem."); } else System.out.println("Input file " + inFile.getPath() + " is AIFF." + " Conversion is unnecessary."); } catch (UnsupportedAudioFileException e) { System.out.println("Error: " + inFile.getPath() + " is not a supported audio file type!"); return; } catch (IOException e) { System.out.println("Error: failure attempting to read " + inFile.getPath() + "!"); return; } }
As mentioned, the purpose of this example function,
ConvertFileToAIFF
, is to query an input file to determine whether
it's an AIFF sound file, and if it isn't, to try to convert it to one, producing
a new copy whose pathname is specified by the second argument. (As an exercise,
you might try making this function more general, so that instead of always converting
to AIFF, the function converts to the file type specified by a new function
argument.) Note that the audio data format of the copythat is, the new
file-mimics the audio data format of original input file.
Most of this function is self-explanatory and is not specific to the Java Sound API. There are, however, a few Java Sound API methods used by the routine that are crucial for sound file-type conversions. These method invocations are all found in the second try
clause, above, and include the following:
AudioSystem.getAudioFileFormat
: used here to determine whether the input file is already an AIFF type. If so, the function quickly returns; otherwise the conversion attempt proceeds.
AudioSystem.isFileTypeSupported
: Indicates whether the system can write a file of the specified type that contains audio data from the specified AudioInputStream.
In our example, this method returns true
if the specified audio input file can be converted to AIFF audio file format. If AudioFileFormat.Type.AIFF
isn't supported, ConvertFileToAIFF
issues a warning that the input file can't be converted, then returns.
AudioSystem.write
: used here to write the audio data from the AudioInputStream inFileAIS
to the output file outFile
.
isFileTypeSupported
, helps to determine, in advance of the write, whether a particular input sound file can be converted to a particular output sound file type. In the next section we will see how, with a few modifications to this ConvertFileToAIFF
sample routine, we can convert the audio data format, as well as the sound file type.
The previous section showed how to use the Java Sound API to convert a file from one file format (that is, one type of sound file) to another. This section explores some of the methods that enable audio data format conversions.
In the previous section, we read data from a file of an arbitrary type, and saved it in an AIFF file. Note that although we changed the type of file used to store the data, we didn't change the format of the audio data itself. (Most common audio file types, including AIFF, can contain audio data of various formats.) So if the original file contained CD-quality audio data (16-bit sample size, 44.1-kHz sample rate, and two channels), so would our output AIFF file.
Now suppose that we want to specify the data format of the output file, as well as the file type. For example, perhaps we are saving many long files for use on the Internet, and are concerned about the amount of disk space and download time required by our files. We might choose to create smaller AIFF files that contain lower-resolution data-for example, data that has an 8-bit sample size, an 8-kHz sample rate, and a single channel.
Without going into as much coding detail as before, let's explore some of the methods used for data format conversion, and consider the modifications that we would need to make to the ConvertFileToAIFF
function to accomplish the new goal.
The principal method for audio data conversion is, once again, found in the AudioSystem
class. This method is a variant of getAudioInputStream
:
This function returns anAudioInputStream getAudioInputStream(AudioFormat format, AudioInputStream stream)
AudioInputStream
that is the result of converting the AudioInputStream
, stream
, using the indicated AudioFormat
, format
. If the conversion isn't supported by AudioSystem
, this function throws an IllegalArgumentException
.
To avoid that, we can first check whether the system can perform the required conversion by invoking this AudioSystem
method:
In this case, we'd passboolean isConversionSupported(AudioFormat targetFormat, AudioFormat sourceFormat)
stream.getFormat()
as the second argument.
To create a specific AudioFormat
object, we use one of the two AudioFormat
constructors shown below, either
which constructs anAudioFormat(float sampleRate, int sampleSizeInBits, int channels, boolean signed, boolean bigEndian)
AudioFormat
with a linear PCM encoding and the given parameters, or
which also constructs anAudioFormat(AudioFormat.Encoding encoding, float sampleRate, int sampleSizeInBits, int channels, int frameSize, float frameRate, boolean bigEndian)
AudioFormat
, but lets you specify the encoding, frame size, and frame rate, in addition to the other parameters.
Now, armed with the methods above, let's see how we might extend our ConvertFileToAIFF
function to perform the desired "low-res" audio data format conversion. First, we would construct an AudioFormat
object describing the desired output audio data format. The following statement would suffice and could be inserted near the top of the function:
Since theAudioFormat outDataFormat = new AudioFormat((float) 8000.0, (int) 8, (int) 1, true, false);
AudioFormat
constructor above is describing a format with 8-bit samples, the last parameter to the constructor, which specifies whether the samples are big or little endian, is irrelevant. (Big versus little endian is only an issue if the sample size is greater than a single byte.)
The following example shows how we would use this new AudioFormat
to convert the AudioInputStream
, inFileAIS
, that we created from the input file:
It wouldn't matter too much where we inserted this code, as long as it was after the construction ofAudioInputStream lowResAIS; if (AudioSystem.isConversionSupported(outDataFormat, inFileAIS.getFormat())) { lowResAIS = AudioSystem.getAudioInputStream (outDataFormat, inFileAIS); }
inFileAIS
. Without the isConversionSupported
test, the call would fail and throw an IllegalArgumentException
if the particular conversion being requested was unsupported. (In this case, control would transfer to the appropriate catch
clause in our function.)
So by this point in the process, we would have produced a new AudioInputStream
, resulting from the conversion of the original input file (in its AudioInputStream
form) to the desired low-resolution audio data format as defined by outDataFormat
.
The final step to produce the desired low-resolution, AIFF sound file would be to replace the AudioInputStream
parameter in the call to AudioSystem.write
(that is, the first parameter) with our converted stream, lowResAIS
, as follows:
These few modifications to our earlier function produce something that converts both the audio data and the file format of any specified input file, assuming of course that the system supports the conversion.AudioSystem.write(lowResAIS, AudioFileFormat.Type.AIFF, outFile);
Several AudioSystem
methods test their parameters to determine whether the system supports a particular data format conversion or file-writing operation. (Typically, each method is paired with another that performs the data conversion or writes the file.) One of these query methods, AudioSystem.isFileTypeSupported
, was used in our example function, ConvertFileToAIFF
, to determine whether the system was capable of writing the audio data to an AIFF file. A related AudioSystem
method, getAudioFileTypes(AudioInputStream)
, returns the complete list of supported file types for the given stream, as an array of AudioFileFormat.Type
instances. The method:
is used to determine whether an audio input stream of the specified encoding can be obtained from an audio input stream that has the specified audio format. Similarly, the method:boolean isConversionSupported(AudioFormat.Encoding encoding,
AudioFormat format)
boolean isConversionSupported(AudioFormat newFormat, AudioFormat oldFormat)
tells us whether an AudioInputStream
with the specified audio format, newFormat
, can be obtained through the conversion of an AudioInputStream
that has the audio format oldFormat
. (This method was invoked in the previous section's code excerpt that created a low-resolution audio input stream, lowResAIS
.)
These format-related queries help prevent errors when attempting to perform format conversions with the Java Sound API.