PDF documents are binary files, which makes them much more complex than plaintext files.
In addition to text, they store font, color and layout information.
PyPDF2 does not have a way to extract images, charts or other media from PDF documents, but it can extract text and return it as a Python string.
Github link for the code : https://github.com/BekBrace/pdf-to-te...
DEV profile : https://dev.to/bekbrace
Github profile: https://github.com/BekBrace