Skip to main content

https://accessibility.blog.gov.uk/2023/06/12/making-a-positive-change-pdf-to-html/

Making a positive change: PDF to HTML

Over 100 participants attended our live session on digital accessibility last November. As communicators, we must take care to make our content as easy to understand and accessible as possible.

In order to promote designing accessibility into the system and making people care about accessibility, let’s continue the discussion.

PDF and climate change

Perhaps the most surprising outcome from the panel was about Carbon Dioxide emission in relation to the use of PDF as a communication medium.

The environmental case for HTML over PDF was drawn from research and discussion between Government Digital Service (GDS) and Welsh Government content strategists. In the examples presented during the panel session, Pingdom was used to measure the size of a particular HTML page and website carbon calculator was used to estimate the amount of CO2 emitted for that page.

It’s worth noting that other sources use a different means of measuring the effective size of a HTML page and estimating the amount of CO2 emitted per view.

What is evident across all methods of analysis however is that larger file size = more data transferred = more CO2 emitted. This is particularly evident when PDFs use large amounts of visual content, such as decorative graphics and high-resolution images or diagrams.

Consider inclusion beyond disability. Light, mobile-friendly web pages are critical for users who experience data poverty or have intermittent connections in remote locations. Loading speed has a major impact on the user experience of any online activity, including government services and information. The following image shows the size of a document in HTML and PDF formats.

Screenshot of the gov.uk webpage about the COVID-19 response: Living with COVID-19 the document has been published in both PDF and HTML format.

The HTML page shown is significantly smaller at 1.4 MB than the PDF at nearly 2 MB – 42% larger – despite both containing the same information. The PDF emits an estimated 0.561g of CO2e every time someone views it versus 0.395g for the HTML version.

Read: Publishing PDFs and other files on GOV.WALES.

Legal documents and PDF

For legal documents, you will have to check with the subject matter experts and or lawyers, the exact requirements. You may gain agreement that publication does not require a PDF and that you can publish the information as a web page.

This can be more difficult if a stamp or signature needs to be displayed. In that case a PDF might be unavoidable.

Emphasise user needs in your discussion. What do you expect people to do? Download, sign and return the PDF or just read it. If the goal is just to provide information than a HTML page is ideal.

What recommendations do you have for setting up a long publication so that it will work in HTML format (for example an annual report)?

Plan the images, graphs and tables and layout from the start and work with your design team/agency to ensure all assets are accessible.

Discuss options for landscape images, font size for diagrams and alternative and additional options for CSV file of data tables.

Consider breaking down a long publication into chapters and publish one chapter per web page. This can make larger online publications easier to navigate. It is important to make sure that your website can support long publications in HTML, you might need to build new templates to ensure the best experience for your users.

An alternative way to ensure users get the information they need is by publishing a Word (.DOCX) file instead of a PDF. Users can open these in their preferred word processing software and change the document formatting to better suit them.

Such changes include:

  • increasing text size
  • changing the font
  • changing the text or background colours
  • increasing word or line spacing

Word files can also be easily ported to Braille readers.

What research proves HTML is better than PDF?

The Government Digital Service (GDS) states Compared with HTML content, information published in a PDF is harder to find, use and maintain”.

Read their blog Why GOV.UK content should be published in HTML and not PDF to find out about the other problems with PDFs.

 

Is there still a place for PDF?

The answer is it depends.

GDS’ blog explains why they remain popular in government. One example includes control over the layout and design.

It all comes back to the user’s needs:

  • Do they have time to download the PDF?
  • Will they do it on their phone or tablet?
  • Will the layout help communicate information?

PDFs can be unwieldy, slow to download and costly to update.

Consider the needs of the people you are publishing the information for. Engage with them early to explore alternative options that may better meet their needs.

Dos and don’ts on designing for accessibility.

Continue your accessibility journey

We suggest you keep asking about the users’ needs at the planning stage. Ask if people have thought about a web page as an asset, instead of a PDF. Explain the advantages of presenting information in HTML. Making your communication, your content and your message more accessible means everyone will benefit.

Use our guidance on Accessible communications

Watch our webinars:

Explore tutorials on how to make something HTML and how to accessibility check a PDF

HTML

PDF

With thanks to our panellists for contributing to this blog post:

  • Stephanie Hill, Digital Content Lead, UK Health Security Agency
  • Samantha Merrett, the Accessibility Lead at the Food Standards Agency
  • Chris Comber, Graphic Designer for GRS/GBS in the Cabinet Office
  • William Rees, Senior Content Designer and accessibility specialist at the Welsh Government

Sharing and comments

Share this page

4 comments

  1. Comment by Caroline Gill posted on

    This is very interesting. Thank you.

    Are you aware of a tool for turning PDFs into html that a local council could use? The one GDS uses produces a great result, but I don't believe councils can use that.

    Thank you.

  2. Comment by Chad Gowler posted on

    Given the recommendation that users are provided a docx file, do you have the stats for their file size/CO2 output vs the HTML page?

    I’d also be interested if something like including print styles and functionality was considered as well?

    • Replies to Chad Gowler>

      Comment by samanthamerrett posted on

      Hi Chad,

      Thanks for your message.

      The report authors have provided the following response:

      PDF and DOCX filetypes are larger in size than the same content properly rendered as a responsive HTML page. Both contain proprietary code used to calculate positioning and font sizes, and render styles and other rich content.

      We recommend publishing information as HTML by default, due to file size savings, analytics and SEO purposes, and close to universal access of the format.

      Where that's not possible or you need to publish information in an alternative format alongside the HTML page, .DOCX files offer users more options to customise the content than PDFs, making it easier to digest for their needs. It's also easier to make a .DOCX file work well with assistive technologies than a PDF.

      While this represents a trade-off between file size and functionality, it's mitigated by the fact that the number of users who need to download the alternative format document would be low compared to those who view the same information in HTML.

      We hope this helps to answer your query, thank you.