If you’re an editor or proofreader who routinely works on mathematics or science material, you’ll be used to dealing with figures, percentages, tables and graphs. But even editors who work on other texts (academic, business, and even fiction) will sometimes need to handle numbers and data.
Perhaps you’re editing a survey report, a paper containing the results of a research study, or an organisation’s annual report. Even if the client isn’t expecting you to perform an in-depth analysis of their calculations and data presentation (and you feel it’s out of your area of expertise – or your comfort zone), there are some straightforward things you can look out for to help your author keep their data in line.
Most of what follows is based on my own experience, so it isn’t intended to be an exhaustive list of issues. But I hope it will reassure you that checking figures and data presentation doesn’t necessarily require you to have a PhD in mathematics. Your basic editing and proofreading skills – together with a bit of logic and common sense – can often help you to spot when something’s amiss.
1. Words or figures in the text?
In some (though not all) conventions, numbers within the same sentence should be made consistent:
- I bought 3 apples and 13 pears. YES
- I bought three apples and 13 pears. NO
In formal writing, it’s preferable to avoid starting a sentence with a figure. This might simply mean using words instead:
- Twenty people visited the museum. YES
- 20 people visited the museum. NO
Alternatively – and particularly if the number is a large one, or it isn’t a whole number – it’s better to reword the sentence.
- On average, 345.23 people visited the museum each day. YES
- 345.23 people on average visited the museum each day. NO
If you’re editing fiction, conventions might be slightly different. For example, numbers are often expressed in words when they appear in dialogue.
2. Talking about numbers in the text
- According to our research, 50% of dentists are women. YES
- According to our research, 50% of women are dentists. NO
It’s easy to see how the confusion arises: the sentences contain the same words – just in a slightly different order. In some cases – like this one – it’s obvious that one of these sentences is incorrect, purely on the basis of general knowledge. In other cases, such inaccuracies are more difficult to spot without checking elsewhere in the document.
Other things to look out for in the text include:
- Making sure ‘greater than’ (>) and ‘less than’ (<) are the right way round (sometimes mix-ups occur when the author has rearranged the text);
- Making sure ‘significant’ and ‘not significant’ are correctly attributed (particularly relevant when statistical analysis is being reported).
3. Consistency and common sense
Similarly, in the same way that you’d use your own knowledge to spot errors when proofreading a general piece of text, you can sometimes see clearly that a mistake has been made. If an author claims that the population of London is 8,787 or 8.7 billion, you’ll probably realise that there’s something wrong. In a school report I was proofreading, I noticed that a student was congratulated on achieving 1.25 cm in the high jump, which conjured up an amusing – though misleading – image.
4. A note about averages
5. Problems with percentages
As well as basic calculation mistakes, it’s worth being on the lookout for inaccurate descriptions of percentages. I’ve seen ‘majority’ used to mean ‘the largest proportion’. Here’s an example:
- 25% of people in the group are self-employed
- 35% of people are retired
- 40% of people are employed
This does not mean that ‘the majority of people are employed’ – ‘majority’ means ‘most’ (i.e. more than 50%), rather than simply the largest group.
Another thing to check is whether the percentages add up to 100. But beware – that might not always be appropriate. For example, if reporting on answers to a survey question where people could tick more than one option, the total could well be more than 100%. In this example, it’s clear that some people like both apples and bananas:
- 65% of people like apples
- 73% of people like bananas
In cases like this, the percentages won’t necessarily add up to 100.
A more subtle issue when it comes to percentage is whether the author really does mean ‘per cent’ (%), or whether they mean ‘percentage points’. As an example, if the unemployment rate in 2001 was 5% and the unemployment rate in 2011 was 10%, the correct way to describe this would be to say that the 2011 rate is 5 percentage points higher than the 2001 rate. The unemployment rate is definitely not 5% higher (in fact, it is 100% higher!).
6. Calculation check
You can also use Excel to perform other straightforward calculations (%, −, ×, ÷). Even if you don’t routinely use Excel, it’s worth familiarising yourself with the basic functions and with how to create a simple formula. Of course, you can always check figures using pen and paper or a calculator, but Excel can save you quite a bit of time (and – if used with care – can reduce the risk that you’ll make errors in your own calculations).
7. Equations
8. Number ranges
- 0–10, 10–20, 20–30, … and so on.
This is a problem because the ranges don’t have clear upper and lower limits: they overlap. For example, in which category would a value of ‘20’ be placed? Only the originator of the data would know the answer, so there’s usually very little that the editor or proofreader can do to correct this, other than query it. Ideally, ranges should look like this:
- 0–9, 10–19, … and so on.
Or like this:
- 0–10, 11–20, etc.
And if the data includes values that are not whole numbers (e.g. 2.8, 19.99, etc.), the ranges will need to be more exact:
- 0–9.99, 10.00–19.99, etc.
9. Units, decimal separators and thousands separators
a) Units
- Are these consistently abbreviated (km) or spelled out (kilometres)?
- If they’re abbreviated, are they closed up to the figure (5km)? If there’s a space, should this be a non-breaking space so that the figure and the unit do not become separated over a line break?
- How should percentages be expressed: 25%, 25 % (with a non-breaking space), 25 percent (tends to be US spelling) or 25 per cent (tends to be UK spelling)?
b) Decimal separators
In UK and US English, a full stop (full point) is usually used to separate whole numbers from decimals. The decimal point is usually on the baseline of the text, but sometimes a middle dot is used (e.g. 34·12). And in some countries a comma is used as a decimal separator. I’ve come across this most often in work by European authors, but the convention is also followed elsewhere.
c) Thousands separators
Similarly, there are different conventions for separating groups of digits in larger numbers. In many cases a comma is used (10,000), but some styles call for a non-breaking space (10 000).
10. Checking charts
a) A suitable style
Does the chart present the data clearly and unambiguously? For example, pie charts are often not a good way of presenting data, as this article explains. When checking a chart, graph or diagram, you need to ask whether it makes sense. Can you think of a clearer or better way of presenting the data?
b) Gimmicks
It’s tempting to use colours and special effects to make a chart, graph or diagram more eye-catching and ‘interesting’. However, that’s often not necessary (unless such effects are part of house style or branding). Bear in mind that colours, shapes and fancy shading can be distracting and confusing for the reader. In any case, such effects will often be stripped out at the next stage of the publication process.
c) Axes
As a rule, these should always start at zero. In this example, the chart on the right seems to be suggesting that The Times newspaper has twice as many sales as the Daily Telegraph, but that’s simply because the vertical axis starts at 420,000 rather than at zero. Results can easily be distorted if the chart isn’t showing the full picture.
d) When charts go wrong
Special mention must be made of this chart, which shows the average female height in various countries. It’s misleading in a couple of important ways:
- The shapes chosen to represent the different heights are of different widths (in proportion to their heights). So the ‘Latvia’ shape is larger in all dimensions than the ‘India’ shape, completely distorting the picture.
- The vertical axis does not start at zero. Yes, the values represented are all above 5 feet, which is probably why the authors decided to present the data like this. But that has the effect of suggesting that women in Latvia are four times as tall as women in India.
11. Tables
When it comes to checking the details within a table, the advice I’ve already mentioned is relevant. Use your proofreading skills to check whether figures are consistent with those mentioned in the text. Add up columns to check totals. Apply your common sense to make sure the data looks correct.
It’s also worth checking the following specific points:
a) Units
Are the units clearly stated? For a table displaying only one type of data (e.g. percentages or monetary values), the units are sometimes included in the table caption:
Table 1: Owner-occupiers as a proportion of the population, 1950–2017 (%)
If the table shows different types of information, the units might be included with each value. So the columns will look something like this:
Check whether items are listed in a consistent way. For example, in the table above, the ‘countries’ are shown in alphabetical order. But the author could also have chosen to list them in ascending or descending order, by either the unemployment rate or the average income. Either of these alternatives would have been acceptable. However, if items are listed in apparently random or inconsistent order (e.g. in a series of tables), this might be confusing to the reader.
c) Row and column headings
Do these clearly explain the data in the table? If there’s more than one table with a similar layout, are the row and column headings presented in a consistent way?
d) Table layout
I recently came across a table similar to this one (I’ve only included part of the table, although the rest of it was similarly misleading). The author had tried to present information about the sample population, but the table was laid out in such a way that it implied an association between different variables. For example, it appeared that all the male participants are in the three younger age groups, and that all the female participants are in the older age groups.
Rather than presenting the information in this way, it would have been better to keep all the categories separate. This is the layout that I suggested to the author.
- Consult the style guide (if any) to check what’s required.
- Consult Butcher’s Copy-editing and New Hart’s Rules, both of which have useful sections on science and mathematics.
- Use logic and common sense to judge when something’s not quite right.
- Use Excel to check straightforward calculations.
- If numbers (and the way they are presented) are repeated in different parts of the document, check them for consistency.
- Check for similar statements about numerical data in different parts of the document (in the abstract, executive summary and conclusion, for example) and make sure they’re consistent.
- Ask yourself whether charts and tables present the data in a clear way.
- Query with the author if anything looks amiss.