Most content doesn't just include text — it also has images, diagrams, metadata, and sometimes markup like XML or HTML. But Acrolinx only checks the text.
To get text out of your content, Acrolinx uses text extraction. For more information on how this works, learn how to create or configure a Content Profile.
The following diagram shows you how Acrolinx reads content: