A comprehensive reference for audiobook narrators, studios, and publishers.
The technical audio specifications that Audible/ACX requires for all audiobook submissions, covering sample rate, bit depth, loudness, noise floor, peak level, and file format.
A voice actor who performs and records the spoken audio for audiobooks, managing the entire production process from manuscript preparation through final audio delivery.
Metadata embedded in audiobook files that divide the audio into navigable chapters, allowing listeners to skip between sections on their playback device.
A smooth audio transition where the outgoing audio fades out while the incoming audio fades in, used at punch-in points to create seamless, click-free edits.
Software used to record, edit, and produce audio, ranging from general-purpose tools like Pro Tools and Reaper to purpose-built applications like Punch Track.
A lossless audio compression format that reduces file size without sacrificing any audio quality, making it ideal for recording sessions where quality must be preserved.
The final audio processing stage before delivery, where recorded chapters are adjusted for loudness, peak levels, noise floor, and format to meet distributor specifications.
Unwanted clicking or sticky sounds caused by saliva and tongue movement during recording, one of the most common quality issues in audiobook narration.
The level of background noise present in a recording when no one is speaking, with ACX requiring -60dB or lower for audiobook submissions.
A recording method where the narrator records straight through the entire chapter without stopping, planning to edit out mistakes in post-production.
The standard payment unit in audiobook production, representing the rate paid for each hour of final, mastered audio delivered.
A section of an audiobook chapter that a reviewer or proofer has flagged for re-recording, typically due to mispronunciation, wrong emphasis, or a missed line.
A person who listens through recorded audiobook chapters to identify errors, mark pickups, and verify that the narration matches the manuscript accurately.
A recording technique where the narrator listens back to the last few seconds of audio before seamlessly re-recording from the point of a mistake, producing a clean, edit-free take in real time.
Root Mean Square level measures the average perceived loudness of an audio signal, with ACX requiring audiobook chapters to fall between -23dB and -18dB RMS.
The natural ambient sound of a recording space captured with no one speaking, used as a reference for noise floor measurement and to fill gaps in edited audio.
The number of times per second an audio signal is measured during digital recording, with 44.1 kHz being the industry standard for audiobook production.
Harsh, piercing “s” and “sh” sounds in recorded speech caused by high-frequency energy concentration, often managed through microphone technique and de-essing.
A verbal identification spoken at the beginning of a recording take, typically including the chapter number, page, and take number to help organize audio during editing.