Automated Analysis V7.0 no-LT

AH Formatter V7.0 is able to automatically detect a range of typographic problems in the formatted Area Tree. Solving these problems usually requires editorial or stylistic changes, and sometimes both.

Specify -analyze on the AH Formatter V7.0 Command-line Interface to enable analysis. Any problems detected are reported in the error information output. An XSLT stylesheet for annotating the Area Tree XML with visual indications of the analysis problems is available on GitHub at https://github.com/AntennaHouse/analysis-utility.

Languages, including English, each have stylistic conventions for formatted text. The origins of the conventions may be for readability, for aesthetics, for commercial reasons, or for a mix of these. Some are now just considered good design without reference to the underlying reason. Books on typography or book design will usually cover a subset of possible problems, but even the reference books differ in what they consider to be a problem, the threshold for a condition becoming a problem, and even the terminology for describing a problem.

The checks, which can be individually enabled or disabled in the Option Setting File, are:

Error messages from analyzing the Area Tree may include one or both of a message about the error and an indication of the location of the error. The general format of an analyzer error in XML format is:

<error level="1" code="45959">
Message
location:: page: 3; ax: 18; ay: 242; bx: 191.764; by: 270;
</error>

where:

Note that the error indication area does not always correspond to a single area in the Area Tree.

CAUTION: Automated Analysis is not effective with AH Formatter V7.0 Lite.

End Blank Pages

This analyzer reports when the number of blank pages exceeds the end-blank-pages-limit option setting.

The printing and binding method used for a book may require that the book is a multiple of 8, 16, 32, or even more pages. Extensions to the force-page-count property make this possible with AH Formatter V7.0. However, the forced page count may create empty pages at the end of the document just to fulfil the requirement. Empty pages are a cost to the publisher with little or no obvious benefit.

Hyphens On Consecutive Lines

This analyzer reports when the number of consecutive lines that end with a hyphen exceeds the hyphen-limit option setting.

Too many consecutive lines that end with a hyphen increase the likelihood that a reader will either skip reading a line or read the same line twice. Both the Chicago Manual of Style (17th edition) and Elements of Typographic style recommend a maximum of three consecutive lines that end with a hyphen.

AH Formatter V7.0 implements the XSL 1.1 hyphenation-ladder-count property for automatically limiting the number of successive lines that end with a hyphen. For CSS, it implements the similar hyphenate-lines from an early draft of GCPM. However, the appropriate property may not have been set, or it may be preferable to review the lines before letting AH Formatter V7.0 automatically adjust the lines.

Consecutive Lines Starting or Ending With The Same Word

This analyzer reports when the number of consecutive lines that start with the same word exceeds the line-start-repeat-limit option setting or the number of consecutive lines that end with the same word exceeds the line-end-repeat-limit option setting.

This is similar to the problem with multiple consecutive lines that end on a hyphen. Multiple consecutive lines that start with the same word or lines that end with the same word can result in a reader either skipping a line of text or rereading a line. The Chicago Manual of Style (17th edition) recommends a maximum of three lines that either start or end with the same word. Book Typography warns against multiple lines that end with the same word but does not provide a limit and does not mention lines that start with the same word.

Page Widow

This analyzer reports when a short last line of a block of text is formatted as the first line on a page or column. The line is too short when it is shorter than either the page-widow-limit-em or page-widow-limit-percent option setting.

“Widows” is name of the XSL and CSS property that controls the number of line of a paragraph that must be left at the top of a page.

Paragraph Widow

This analyzer reports when a block of text has a short last line. The line is too short when it is shorter than either the paragraph-widow-limit-em or paragraph-widow-limit-percent option setting.

“Widows” is name of the XSL and CSS property that controls the number of line of a paragraph that must be left at the top of a page.

Unbalanced Spread

This analyzer reports when the formatted length of the text in corresponding body regions on the left-hand and right-hand pages of a two-page spread are different.

A spread is not checked if either the left or right page of the spread is the last page of its page sequence.

References