{"id":581,"date":"2021-03-17T15:18:00","date_gmt":"2021-03-17T12:18:00","guid":{"rendered":"http:\/\/is19-2017.susu.ru\/matveev\/?p=581"},"modified":"2021-03-17T16:26:03","modified_gmt":"2021-03-17T13:26:03","slug":"rabota-s-pdf-fajlami-na-python","status":"publish","type":"post","link":"https:\/\/is19-2017.susu.ru\/matveev\/2021\/03\/17\/rabota-s-pdf-fajlami-na-python\/","title":{"rendered":"\u0420\u0430\u0431\u043e\u0442\u0430 \u0441 PDF-\u0444\u0430\u0439\u043b\u0430\u043c\u0438 \u043d\u0430 Python"},"content":{"rendered":"<p align=\"justify\">\n\u0421\u0441\u044b\u043b\u043a\u0430 \u043d\u0430 <a href=\"https:\/\/github.com\/Ewoqi1290u3\/pdf_with_python\/\">GitHub<\/a> \u0441 \u0438\u0441\u0445\u043e\u0434\u043d\u044b\u043c\u0438 \u0434\u0430\u043d\u043d\u044b\u043c\u0438 \u0438 \u0433\u043e\u0442\u043e\u0432\u044b\u043c \u043f\u0440\u043e\u0435\u043a\u0442\u043e\u043c\n<\/p>\n<p align=\"justify\">\n1. \u041d\u0435\u043e\u0431\u0445\u043e\u0434\u0438\u043c\u043e \u0443\u0441\u0442\u0430\u043d\u043e\u0432\u0438\u0442\u044c <a href=\"https:\/\/www.python.org\/downloads\/\">python<\/a>\n<\/p>\n<p align=\"justify\">\n2. \u0414\u043b\u044f \u0440\u0430\u0431\u043e\u0442\u044b \u0441 pdf-\u0444\u0430\u0439\u043b\u0430\u043c\u0438 \u043d\u0430\u043c \u043f\u043e\u043d\u0430\u0434\u043e\u0431\u0438\u0442\u0441\u044f \u0443\u0441\u0442\u0430\u043d\u043e\u0432\u0438\u0442\u044c \u0441\u043b\u0435\u0434\u0443\u044e\u0449\u0438\u0435 \u043f\u0430\u043a\u0435\u0442\u044b \u0447\u0435\u0440\u0435\u0437 \u043a\u043e\u043c\u0430\u043d\u0434\u043d\u0443\u044e \u0441\u0442\u0440\u043e\u043a\u0443 (Windows):<\/p>\n<pre class=\"wp-code-highlight prettyprint\">\r\npip3 install pypdf2\r\npip3 install pymupdf\r\npip3 install pdfrw\r\n<\/pre>\n<\/p>\n<p align=\"justify\">\n3. \u0414\u0430\u043b\u0435\u0435 \u0441\u043e\u0437\u0434\u0430\u0435\u043c \u0431\u0438\u0431\u043b\u0438\u043e\u0442\u0435\u043a\u0443 \u0434\u043b\u044f \u0443\u0434\u043e\u0431\u0441\u0442\u0432\u0430 \u0445\u0440\u0430\u043d\u0435\u043d\u0438\u0435 \u0440\u0435\u0437\u0443\u043b\u044c\u0442\u0430\u0442\u043e\u0432 \u0440\u0430\u0431\u043e\u0442\u044b<br \/>\n<img loading=\"lazy\" src=\"http:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/1-1.png\" alt=\"\" width=\"780\" height=\"192\" class=\"aligncenter size-full wp-image-585\" srcset=\"https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/1-1.png 780w, https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/1-1-300x74.png 300w, https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/1-1-768x189.png 768w\" sizes=\"(max-width: 780px) 100vw, 780px\" \/>\n<\/p>\n<h1 align=\"center\">\n<strong>\u0427\u0442\u0435\u043d\u0438\u0435 \u0438 \u0440\u0430\u0437\u0431\u043e\u0440<\/strong><br \/>\n<\/h1>\n<p align=\"justify\">\n4. \u0418\u0437\u0432\u043b\u0435\u0447\u0435\u043d\u0438\u0435 \u0442\u0435\u043a\u0441\u0442\u0430 \u0441 \u043f\u043e\u043c\u043e\u0449\u044c\u044e PyPDF2\n<\/p>\n<p align=\"justify\">\n\u0421\u043e\u0437\u0434\u0430\u0434\u0438\u043c \u0438 \u0437\u0430\u043f\u0443\u0441\u0442\u0438\u043c \u0441\u043b\u0435\u0434\u0443\u044e\u0449\u0438\u0439 \u0441\u043a\u0440\u0438\u043f\u0442:<\/p>\n<pre class=\"wp-code-highlight prettyprint\">\r\nfrom PyPDF2 import PdfFileReader\r\n\r\npdf_document = &quot;source\/Computer-Vision-Resources.pdf&quot;\r\nwith open(pdf_document, &quot;rb&quot;) as filehandle:  \r\n    pdf = PdfFileReader(filehandle)\r\n   \r\n    info = pdf.getDocumentInfo()\r\n    pages = pdf.getNumPages()\r\n    print(&quot;\u041a\u043e\u043b\u0438\u0447\u0435\u0441\u0442\u0432\u043e \u0441\u0442\u0440\u0430\u043d\u0438\u0446 \u0432 \u0434\u043e\u043a\u0443\u043c\u0435\u043d\u0442\u0435: %i\\n\\n&quot; % pages)\r\n    print(&quot;\u041c\u0435\u0442\u0430-\u043e\u043f\u0438\u0441\u0430\u043d\u0438\u0435: &quot;, info)\r\n\r\n    for i in range(pages):\r\n        page = pdf.getPage(i)\r\n        print(&quot;\u0421\u0442\u0440.&quot;, i, &quot; \u043c\u0435\u0442\u0430: &quot;, page, &quot;\\n\\n\u0421\u043e\u0434\u0435\u0440\u0436\u0430\u043d\u0438\u0435;\\n&quot;)\r\n        print(page.extractText())\r\n<\/pre>\n<p>\u0412 \u0440\u0435\u0437\u0443\u043b\u044c\u0442\u0430\u0442\u0435 \u043f\u043e\u043b\u0443\u0447\u0430\u0435\u043c \u0441\u043b\u0435\u0434\u0443\u044e\u0449\u0435\u0435:<br \/>\n<img loading=\"lazy\" src=\"http:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/2.png\" alt=\"\" width=\"669\" height=\"714\" class=\"aligncenter size-full wp-image-592\" srcset=\"https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/2.png 669w, https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/2-281x300.png 281w\" sizes=\"(max-width: 669px) 100vw, 669px\" \/>\n<\/p>\n<p align=\"justify\">\n5. \u0418\u0437\u0432\u043b\u0435\u0447\u0435\u043d\u0438\u0435 \u0442\u0435\u043a\u0441\u0442\u0430 \u0441 \u043f\u043e\u043c\u043e\u0449\u044c\u044e PyMuPDF<\/p>\n<pre class=\"wp-code-highlight prettyprint\">\r\nimport fitz\r\n\r\npdf_document = &quot;.\/source\/Computer-Vision-Resources.pdf&quot;\r\ndoc = fitz.open(pdf_document)\r\nprint(&quot;\u0418\u0441\u0445\u043e\u0434\u043d\u044b\u0439 \u0434\u043e\u043a\u0443\u043c\u0435\u043d\u0442: &quot;, doc)\r\nprint(&quot;\\n\u041a\u043e\u043b\u0438\u0447\u0435\u0441\u0442\u0432\u043e \u0441\u0442\u0440\u0430\u043d\u0438\u0446: %i\\n\\n------------------\\n\\n&quot; % doc.pageCount)\r\nprint(doc.metadata)\r\n\r\nfor current_page in range(len(doc)):\r\n    page = doc.loadPage(current_page)\r\n    page_text = page.getText(&quot;text&quot;)\r\n    print(&quot;\u0421\u0442\u0440. &quot;, current_page+1, &quot;\\n\\n\u0421\u043e\u0434\u0435\u0440\u0436\u0430\u043d\u0438\u0435;\\n&quot;)\r\n    print(page_text)\r\n<\/pre>\n<p><img loading=\"lazy\" src=\"http:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/3.png\" alt=\"\" width=\"671\" height=\"717\" class=\"aligncenter size-full wp-image-596\" srcset=\"https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/3.png 671w, https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/3-281x300.png 281w\" sizes=\"(max-width: 671px) 100vw, 671px\" \/>\n<\/p>\n<p align=\"justify\">\n6. \u0418\u0437\u0432\u043b\u0435\u0447\u0435\u043d\u0438\u0435 \u0438\u0437\u043e\u0431\u0440\u0430\u0436\u0435\u043d\u0438\u0439 \u0438\u0437 PDF \u0441 \u043f\u043e\u043c\u043e\u0449\u044c\u044e PyMuPDF<\/p>\n<pre class=\"wp-code-highlight prettyprint\">\r\nimport fitz\r\n\r\npdf_document = &quot;source\/Computer-Vision-Resources.pdf&quot;\r\ndoc = fitz.open(pdf_document)\r\n\r\nprint(&quot;\u0418\u0441\u0445\u043e\u0434\u043d\u044b\u0439 \u0434\u043e\u043a\u0443\u043c\u0435\u043d\u0442&quot;, doc)\r\nprint(&quot;\\n\u041a\u043e\u043b\u0438\u0447\u0435\u0441\u0442\u0432\u043e \u0441\u0442\u0440\u0430\u043d\u0438\u0446: %i\\n\\n------------------\\n\\n&quot; % doc.pageCount)\r\nprint(doc.metadata)\r\n\r\npage_count = 0\r\nfor i in range(len(doc)):\r\n    for img in doc.getPageImageList(i):\r\n        xref = img[0]\r\n        pix = fitz.Pixmap(doc, xref)\r\n        pix1 = fitz.Pixmap(fitz.csRGB, pix)\r\n\r\n        page_count += 1\r\n        pix1.writePNG(&quot;images\/picture_number_%s_from_page_%s.png&quot; % (page_count, i+1))\r\n        print(&quot;Image number &quot;, page_count, &quot; writed...&quot;)\r\n        pix1 = None\r\n<\/pre>\n<p><img loading=\"lazy\" src=\"http:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/4.png\" alt=\"\" width=\"671\" height=\"425\" class=\"aligncenter size-full wp-image-598\" srcset=\"https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/4.png 671w, https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/4-300x190.png 300w\" sizes=\"(max-width: 671px) 100vw, 671px\" \/><\/p>\n<p><img loading=\"lazy\" src=\"http:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/5.png\" alt=\"\" width=\"695\" height=\"226\" class=\"aligncenter size-large wp-image-599\" srcset=\"https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/5.png 869w, https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/5-300x98.png 300w, https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/5-768x250.png 768w\" sizes=\"(max-width: 695px) 100vw, 695px\" \/>\n<\/p>\n<p align=\"justify\">\n7. \u0420\u0430\u0437\u0434\u0435\u043b\u0435\u043d\u0438\u0435 PDF\u2011\u0444\u0430\u0439\u043b\u043e\u0432 \u043d\u0430 \u0441\u0442\u0440\u0430\u043d\u0438\u0446\u044b \u0441 \u043f\u043e\u043c\u043e\u0449\u044c\u044e PyPDF2<\/p>\n<pre class=\"wp-code-highlight prettyprint\">\r\nfrom PyPDF2 import PdfFileReader, PdfFileWriter\r\n\r\npdf_document = &quot;source\/Computer-Vision-Resources.pdf&quot;\r\npdf = PdfFileReader(pdf_document)\r\n\r\nfor page in range(pdf.getNumPages()):  \r\n    pdf_writer = PdfFileWriter()\r\n    current_page = pdf.getPage(page)\r\n    pdf_writer.addPage(current_page)\r\n\r\n    outputFilename = &quot;dist\/Computer-Vision-Resources-page-{}.pdf&quot;.format(page + 1)\r\n    with open(outputFilename, &quot;wb&quot;) as out:\r\n        pdf_writer.write(out)\r\n\r\n        print(&quot;created&quot;, outputFilename)\r\n<\/pre>\n<p><img loading=\"lazy\" src=\"http:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/6.png\" alt=\"\" width=\"669\" height=\"266\" class=\"aligncenter size-full wp-image-601\" srcset=\"https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/6.png 669w, https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/6-300x119.png 300w\" sizes=\"(max-width: 669px) 100vw, 669px\" \/><\/p>\n<p><img loading=\"lazy\" src=\"http:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/7.png\" alt=\"\" width=\"695\" height=\"315\" class=\"aligncenter size-large wp-image-602\" srcset=\"https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/7.png 807w, https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/7-300x136.png 300w, https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/7-768x348.png 768w\" sizes=\"(max-width: 695px) 100vw, 695px\" \/>\n<\/p>\n<p align=\"justify\">\n8. \u041f\u043e\u0438\u0441\u043a \u0441\u0442\u0440\u0430\u043d\u0438\u0446 \u043d\u0430 \u043d\u0430\u043b\u0438\u0447\u0438\u0435 \u0437\u0430\u0434\u0430\u043d\u043d\u043e\u0433\u043e \u0442\u0435\u043a\u0441\u0442\u0430<\/p>\n<pre class=\"wp-code-highlight prettyprint\">\r\nimport fitz\r\n\r\nfilename = &quot;source\/Computer-Vision-Resources.pdf&quot;\r\n\r\nsearch_term = &quot;COMPUTER VISION&quot;  \r\npdf_document = fitz.open(filename)\r\n\r\nfor current_page in range(len(pdf_document)):  \r\n    page = pdf_document.loadPage(current_page)\r\n    if page.searchFor(search_term):\r\n        print(&quot;%s \u043d\u0430\u0439\u0434\u0435\u043d\u043e \u043d\u0430 \u0441\u0442\u0440\u0430\u043d\u0438\u0446\u0435 %i&quot; % (search_term, current_page+1))\r\n<\/pre>\n<p><img loading=\"lazy\" src=\"http:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/8.png\" alt=\"\" width=\"672\" height=\"263\" class=\"aligncenter size-full wp-image-605\" srcset=\"https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/8.png 672w, https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/8-300x117.png 300w\" sizes=\"(max-width: 672px) 100vw, 672px\" \/>\n<\/p>\n<h1 align=\"center\">\n<strong>\u0414\u043e\u0431\u0430\u0432\u043b\u0435\u043d\u0438\u0435 \u0438\u0437\u043e\u0431\u0440\u0430\u0436\u0435\u043d\u0438\u0439 \u0438 \u0432\u043e\u0434\u044f\u043d\u044b\u0445 \u0437\u043d\u0430\u043a\u043e\u0432<\/strong><br \/>\n<\/h1>\n<p align=\"justify\">\n9. \u0414\u043e\u0431\u0430\u0432\u043b\u0435\u043d\u0438\u044e \u0432\u043e\u0434\u044f\u043d\u043e\u0433\u043e \u0437\u043d\u0430\u043a\u0430 \u0441 \u043f\u043e\u043c\u043e\u0449\u044c\u044e PyPDF2<\/p>\n<pre class=\"wp-code-highlight prettyprint\">\r\n# \u0414\u043e\u0431\u0430\u0432\u043b\u0435\u043d\u0438\u0435 \u0432\u043e\u0434\u044f\u043d\u043e\u0433\u043e \u0437\u043d\u0430\u043a\u0430 \u0432 \u043e\u0434\u043d\u043e\u0441\u0442\u0440\u0430\u043d\u0438\u0447\u043d\u044b\u0439 PDF\r\n\r\nimport PyPDF2\r\n\r\ninput_file = &quot;source\/Computer-Vision-Resources.pdf&quot;\r\noutput_file = &quot;dist\/Computer-Vision-Resources-page-drafted.pdf&quot;\r\nwatermark_file = &quot;source\/mshe-logo-512x512.pdf&quot;\r\n\r\nwith open(input_file, &quot;rb&quot;) as filehandle_input:\r\n    # \u0447\u0438\u0442\u0430\u0442\u044c \u0441\u043e\u0434\u0435\u0440\u0436\u0438\u043c\u043e\u0435 \u0438\u0441\u0445\u043e\u0434\u043d\u043e\u0433\u043e \u0444\u0430\u0439\u043b\u0430\r\n    pdf = PyPDF2.PdfFileReader(filehandle_input)\r\n    \r\n    with open(watermark_file, &quot;rb&quot;) as filehandle_watermark:\r\n        # \u0447\u0438\u0442\u0430\u0442\u044c \u0441\u043e\u0434\u0435\u0440\u0436\u0430\u043d\u0438\u0435 \u0432\u043e\u0434\u044f\u043d\u043e\u0433\u043e \u0437\u043d\u0430\u043a\u0430\r\n        watermark = PyPDF2.PdfFileReader(filehandle_watermark)\r\n        \r\n        # \u043f\u043e\u043b\u0443\u0447\u0438\u0442\u044c \u043f\u0435\u0440\u0432\u0443\u044e \u0441\u0442\u0440\u0430\u043d\u0438\u0446\u0443 \u043e\u0440\u0438\u0433\u0438\u043d\u0430\u043b\u044c\u043d\u043e\u0433\u043e PDF\r\n        first_page = pdf.getPage(0)\r\n        \r\n        # \u043f\u043e\u043b\u0443\u0447\u0438\u0442\u044c \u043f\u0435\u0440\u0432\u0443\u044e \u0441\u0442\u0440\u0430\u043d\u0438\u0446\u0443 \u0432\u043e\u0434\u044f\u043d\u043e\u0433\u043e \u0437\u043d\u0430\u043a\u0430 PDF\r\n        first_page_watermark = watermark.getPage(0)\r\n        \r\n        # \u043e\u0431\u044a\u0435\u0434\u0438\u043d\u0438\u0442\u044c \u0434\u0432\u0435 \u0441\u0442\u0440\u0430\u043d\u0438\u0446\u044b\r\n        first_page.mergePage(first_page_watermark)\r\n        \r\n        # \u0441\u043e\u0437\u0434\u0430\u0442\u044c \u043e\u0431\u044a\u0435\u043a\u0442 \u0437\u0430\u043f\u0438\u0441\u0438 PDF \u0434\u043b\u044f \u0432\u044b\u0445\u043e\u0434\u043d\u043e\u0433\u043e \u0444\u0430\u0439\u043b\u0430\r\n        pdf_writer = PyPDF2.PdfFileWriter()\r\n        \r\n        # \u0434\u043e\u0431\u0430\u0432\u0438\u0442\u044c \u0441\u0442\u0440\u0430\u043d\u0438\u0446\u0443\r\n        pdf_writer.addPage(first_page)\r\n        \r\n        with open(output_file, &quot;wb&quot;) as filehandle_output:\r\n            # \u0437\u0430\u043f\u0438\u0441\u0430\u0442\u044c \u0444\u0430\u0439\u043b \u0441 \u0432\u043e\u0434\u044f\u043d\u044b\u043c\u0438 \u0437\u043d\u0430\u043a\u0430\u043c\u0438 \u0432 \u043d\u043e\u0432\u044b\u0439 \u0444\u0430\u0439\u043b\r\n            pdf_writer.write(filehandle_output)\r\n<\/pre>\n<p><img loading=\"lazy\" src=\"http:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/9.png\" alt=\"\" width=\"929\" height=\"1013\" class=\"aligncenter size-full wp-image-607\" srcset=\"https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/9.png 929w, https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/9-275x300.png 275w, https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/9-768x837.png 768w\" sizes=\"(max-width: 929px) 100vw, 929px\" \/>\n<\/p>\n<p align=\"justify\">\n10. \u0414\u043e\u0431\u0430\u0432\u043b\u0435\u043d\u0438\u0435 \u0438\u0437\u043e\u0431\u0440\u0430\u0436\u0435\u043d\u0438\u044f \u0441 \u043f\u043e\u043c\u043e\u0449\u044c\u044e PyMuPDF<\/p>\n<pre class=\"wp-code-highlight prettyprint\">\r\nimport fitz\r\n\r\ninput_file = &quot;source\/Computer-Vision-Resources.pdf&quot;\r\noutput_file = &quot;dist\/Computer-Vision-Resources-page-image.pdf&quot;\r\nbarcode_file = &quot;source\/waksoft-QR-code.jpg&quot;\r\n\r\n# \u043e\u043f\u0440\u0435\u0434\u0435\u043b\u0438\u0442\u044c \u043f\u043e\u0437\u0438\u0446\u0438\u044e (\u0432\u0435\u0440\u0445\u043d\u0438\u0439 \u043f\u0440\u0430\u0432\u044b\u0439 \u0443\u0433\u043e\u043b)\r\nimage_rectangle = fitz.Rect(450, 170, 550, 270)\r\n\r\n# retrieve the first page of the PDF\r\nfile_handle = fitz.open(input_file)\r\nfirst_page = file_handle[0]\r\n\r\n# \u0434\u043e\u0431\u0430\u0432\u0438\u0442\u044c \u0438\u0437\u043e\u0431\u0440\u0430\u0436\u0435\u043d\u0438\u0435\r\nfirst_page.insertImage(image_rectangle, filename=barcode_file)\r\n\r\nfile_handle.save(output_file)\r\n<\/pre>\n<p><img loading=\"lazy\" src=\"http:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/10.png\" alt=\"\" width=\"929\" height=\"879\" class=\"aligncenter size-full wp-image-609\" srcset=\"https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/10.png 929w, https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/10-300x284.png 300w, https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/10-768x727.png 768w\" sizes=\"(max-width: 929px) 100vw, 929px\" \/>\n<\/p>\n<p align=\"justify\">\n11. \u0414\u043e\u0431\u0430\u0432\u043b\u0435\u043d\u0438\u0435 \u0448\u0442\u0430\u043c\u043f\u043e\u0432 \u0441 pdfrw<\/p>\n<pre class=\"wp-code-highlight prettyprint\">\r\n# \u0414\u043e\u0431\u0430\u0432\u043b\u0435\u043d\u0438\u0435 QR-\u043a\u043e\u0434\u0430 \u0432 \u043c\u043d\u043e\u0433\u043e\u0441\u0442\u0440\u0430\u043d\u0438\u0447\u043d\u044b\u0439 PDF \u0434\u043e\u043a\u0443\u043c\u0435\u043d\u0442\r\n\r\nfrom pdfrw import PdfReader, PdfWriter, PageMerge\r\n\r\ninput_file = &quot;source\/Computer-Vision-Resources.pdf&quot;\r\noutput_file = &quot;dist\/Computer-Vision-Resources-QR-pages.pdf&quot;\r\nwatermark_file = &quot;source\/waksoft-QR-code.pdf&quot;\r\n\r\n# \u043e\u043f\u0440\u0435\u0434\u0435\u043b\u044f\u0435\u043c \u043e\u0431\u044a\u0435\u043a\u0442\u044b \u0447\u0442\u0435\u043d\u0438\u044f \u0438 \u0437\u0430\u043f\u0438\u0441\u0438\r\nreader_input = PdfReader(input_file)\r\nwriter_output = PdfWriter()\r\nwatermark_input = PdfReader(watermark_file)\r\nwatermark = watermark_input.pages[0]\r\n\r\n# \u043f\u0440\u043e\u0441\u043c\u0430\u0442\u0440\u0438\u0432\u0430\u0442\u044c \u0441\u0442\u0440\u0430\u043d\u0438\u0446\u044b \u043e\u0434\u043d\u0443 \u0437\u0430 \u0434\u0440\u0443\u0433\u043e\u0439\r\nfor current_page in range(len(reader_input.pages)):\r\n    merger = PageMerge(reader_input.pages[current_page])\r\n    merger.add(watermark).render()\r\n\r\n# \u0437\u0430\u043f\u0438\u0441\u0430\u0442\u044c \u0438\u0437\u043c\u0435\u043d\u0435\u043d\u043d\u044b\u0439 \u043a\u043e\u043d\u0442\u0435\u043d\u0442 \u043d\u0430 \u0434\u0438\u0441\u043a\r\nwriter_output.write(output_file, reader_input)\r\n<\/pre>\n<p><img loading=\"lazy\" src=\"http:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/11.png\" alt=\"\" width=\"927\" height=\"882\" class=\"aligncenter size-full wp-image-611\" srcset=\"https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/11.png 927w, https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/11-300x285.png 300w, https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/11-768x731.png 768w\" sizes=\"(max-width: 927px) 100vw, 927px\" \/>\n<\/p>\n<h1 align=\"center\">\n<strong>\u0412\u0441\u0442\u0430\u0432\u043a\u0430, \u0443\u0434\u0430\u043b\u0435\u043d\u0438\u0435 \u0438 \u0438\u0437\u043c\u0435\u043d\u0435\u043d\u0438\u0435 \u043f\u043e\u0440\u044f\u0434\u043a\u0430 \u0441\u0442\u0440\u0430\u043d\u0438\u0446<\/strong><br \/>\n<\/h1>\n<p align=\"justify\">\n12. \u0423\u0434\u0430\u043b\u0435\u043d\u0438\u0435 \u0441\u0442\u0440\u0430\u043d\u0438\u0446 \u0441 \u043f\u043e\u043c\u043e\u0449\u044c\u044e pdfrw<\/p>\n<pre class=\"wp-code-highlight prettyprint\">\r\n# \u0423\u0434\u0430\u043b\u0438\u0442\u0435 \u043f\u0435\u0440\u0432\u044b\u0435 \u0434\u0432\u0435 \u0441\u0442\u0440\u0430\u043d\u0438\u0446\u044b (\u0442\u0438\u0442\u0443\u043b\u044c\u043d\u044b\u0439 \u043b\u0438\u0441\u0442) \u0438\u0437 PDF\r\n\r\nfrom pdfrw import PdfReader, PdfWriter\r\n\r\ninput_file = &quot;source\/Computer-Vision-Resources.pdf&quot;\r\noutput_file = &quot;dist\/example-updated.pdf&quot;\r\n\r\n# \u041e\u043f\u0440\u0435\u0434\u0435\u043b\u0438\u0442\u044c \u043e\u0431\u044a\u0435\u043a\u0442\u044b \u0447\u0442\u0435\u043d\u0438\u044f \u0438 \u0437\u0430\u043f\u0438\u0441\u0438\r\nreader_input = PdfReader(input_file)\r\nwriter_output = PdfWriter()\r\n\r\n# \u041f\u0435\u0440\u0435\u0439\u0442\u0438 \u043d\u0430 \u0441\u0442\u0440\u0430\u043d\u0438\u0446\u0443 \u043e\u0434\u0438\u043d \u0437\u0430 \u0434\u0440\u0443\u0433\u0438\u043c\r\nfor current_page in range(len(reader_input.pages)):\r\n    if current_page &amp;gt; 1:\r\n        writer_output.addpage(reader_input.pages[current_page])\r\n        print(&quot;adding page %i&quot; % (current_page + 1))\r\n\r\n# \u0417\u0430\u043f\u0438\u0441\u0430\u0442\u044c \u0438\u0437\u043c\u0435\u043d\u0435\u043d\u043d\u044b\u0439 \u043a\u043e\u043d\u0442\u0435\u043d\u0442 \u043d\u0430 \u0434\u0438\u0441\u043a\r\nwriter_output.write(output_file)\r\n<\/pre>\n<p><img loading=\"lazy\" src=\"http:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/12.png\" alt=\"\" width=\"670\" height=\"225\" class=\"aligncenter size-full wp-image-614\" srcset=\"https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/12.png 670w, https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/12-300x101.png 300w\" sizes=\"(max-width: 670px) 100vw, 670px\" \/><\/p>\n<p><img loading=\"lazy\" src=\"http:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/13.png\" alt=\"\" width=\"695\" height=\"689\" class=\"aligncenter size-large wp-image-615\" srcset=\"https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/13.png 931w, https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/13-300x297.png 300w, https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/13-150x150.png 150w, https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/13-768x761.png 768w\" sizes=\"(max-width: 695px) 100vw, 695px\" \/>\n<\/p>\n<p align=\"justify\">\n13. \u0423\u0434\u0430\u043b\u0435\u043d\u0438\u0435 \u0441\u0442\u0440\u0430\u043d\u0438\u0446 \u0441 \u043f\u043e\u043c\u043e\u0449\u044c\u044e PyMuPDF<\/p>\n<pre class=\"wp-code-highlight prettyprint\">\r\n# \u041d\u0430\u043f\u043e\u043c\u043d\u0438\u043c, \u0447\u0442\u043e PyMuPDF \u0438\u043c\u043f\u043e\u0440\u0442\u0438\u0440\u0443\u0435\u0442\u0441\u044f \u043a\u0430\u043a fitz\r\nimport fitz\r\n\r\ninput_file = &quot;source\/Computer-Vision-Resources.pdf&quot;\r\noutput_file = &quot;dist\/Computer-Vision-Resources-rearranged.pdf&quot;\r\n\r\n# \u041e\u043f\u0440\u0435\u0434\u0435\u043b\u0438\u0442\u0435 \u0441\u0442\u0440\u0430\u043d\u0438\u0446\u044b \u0434\u043b\u044f \u0441\u043e\u0445\u0440\u0430\u043d\u0435\u043d\u0438\u044f - 1, 2 \u0438 4\r\nfile_handle = fitz.open(input_file)\r\npages_list = [0,1,3]\r\n\r\n# \u0412\u044b\u0431\u0435\u0440\u0438\u0442\u0435 \u0441\u0442\u0440\u0430\u043d\u0438\u0446\u044b \u0438 \u0441\u043e\u0445\u0440\u0430\u043d\u0438\u0442\u0435 \u0432\u044b\u0432\u043e\u0434\r\nfile_handle.select(pages_list)\r\nfile_handle.save(output_file)\r\n<\/pre>\n<p><img loading=\"lazy\" src=\"http:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/14.png\" alt=\"\" width=\"930\" height=\"816\" class=\"aligncenter size-full wp-image-617\" srcset=\"https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/14.png 930w, https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/14-300x263.png 300w, https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/14-768x674.png 768w\" sizes=\"(max-width: 930px) 100vw, 930px\" \/>\n<\/p>\n<p align=\"justify\">\n14. \u0412\u0441\u0442\u0430\u0432\u043a\u0430 \u0441\u0442\u0440\u0430\u043d\u0438\u0446 \u0441 \u043f\u043e\u043c\u043e\u0449\u044c\u044e PyMuPDF<\/p>\n<pre class=\"wp-code-highlight prettyprint\">\r\n# \u041d\u0430\u043f\u043e\u043c\u043d\u0438\u043c, \u0447\u0442\u043e PyMuPDF \u0438\u043c\u043f\u043e\u0440\u0442\u0438\u0440\u0443\u0435\u0442\u0441\u044f \u043a\u0430\u043a fitz\r\nimport fitz\r\n\r\noriginal_pdf_path = &quot;source\/Computer-Vision-Resources.pdf&quot;\r\nextra_page_path = &quot;dist\/extra-page.pdf&quot;\r\noutput_file_path = &quot;dist\/example-extended.pdf&quot;\r\n\r\noriginal_pdf = fitz.open(original_pdf_path)\r\nextra_page = fitz.open(extra_page_path)\r\n\r\noriginal_pdf.insertPDF(extra_page)\r\noriginal_pdf.save(output_file_path)\r\n<\/pre>\n<p><img loading=\"lazy\" src=\"http:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/15.png\" alt=\"\" width=\"928\" height=\"851\" class=\"aligncenter size-full wp-image-619\" srcset=\"https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/15.png 928w, https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/15-300x275.png 300w, https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/15-768x704.png 768w\" sizes=\"(max-width: 928px) 100vw, 928px\" \/>\n<\/p>\n<p align=\"justify\">\n15. \u0420\u0430\u0437\u0434\u0435\u043b\u0435\u043d\u0438\u0435 \u0447\u0435\u0442\u043d\u044b\u0445 \u0438 \u043d\u0435\u0447\u0435\u0442\u043d\u044b\u0445 \u0441\u0442\u0440\u0430\u043d\u0438\u0446 \u0441 \u043f\u043e\u043c\u043e\u0449\u044c\u044e PyPDF2<\/p>\n<pre class=\"wp-code-highlight prettyprint\">\r\nfrom PyPDF2 import PdfFileReader, PdfFileWriter\r\n\r\npdf_document = &quot;source\/Computer-Vision-Resources.pdf&quot;\r\npdf = PdfFileReader(pdf_document)\r\n\r\n# \u0412\u044b\u0445\u043e\u0434\u043d\u044b\u0435 \u0444\u0430\u0439\u043b\u044b \u0434\u043b\u044f \u043d\u043e\u0432\u044b\u0445 PDF-\u0444\u0430\u0439\u043b\u043e\u0432\r\noutput_filename_even = &quot;dist\/Computer-Vision-even.pdf&quot;\r\noutput_filename_odd = &quot;dist\/Computer-Vision-odd.pdf&quot;\r\n\r\npdf_writer_even = PdfFileWriter()\r\npdf_writer_odd = PdfFileWriter()\r\n\r\n# \u041f\u043e\u043b\u0443\u0447\u0438\u0442\u044c \u0434\u043e\u0441\u044f\u0433\u0430\u0435\u043c\u0443\u044e \u0441\u0442\u0440\u0430\u043d\u0438\u0446\u0443 \u0438 \u0434\u043e\u0431\u0430\u0432\u0438\u0442\u044c \u0435\u0435 \u0432 \u0441\u043e\u043e\u0442\u0432\u0435\u0442\u0441\u0442\u0432\u0443\u044e\u0449\u0443\u044e\r\n# \u0432\u044b\u0445\u043e\u0434\u043d\u043e\u0439 \u0444\u0430\u0439\u043b \u043d\u0430 \u043e\u0441\u043d\u043e\u0432\u0435 \u043d\u043e\u043c\u0435\u0440\u0430 \u0441\u0442\u0440\u0430\u043d\u0438\u0446\u044b\r\nfor page in range(pdf.getNumPages()):\r\n    current_page = pdf.getPage(page)\r\n    if page % 2 == 0:\r\n        pdf_writer_odd.addPage(current_page)\r\n    else:\r\n        pdf_writer_even.addPage(current_page)\r\n\r\n# \u0417\u0430\u043f\u0438\u0441\u0430\u0442\u044c \u0434\u0430\u043d\u043d\u044b\u0435 \u043d\u0430 \u0434\u0438\u0441\u043a\r\nwith open(output_filename_even, &quot;wb&quot;) as out:\r\n     pdf_writer_even.write(out)\r\n     print(&quot;created&quot;, output_filename_even)\r\n\r\n# \u0417\u0430\u043f\u0438\u0441\u0430\u0442\u044c \u0434\u0430\u043d\u043d\u044b\u0435 \u043d\u0430 \u0434\u0438\u0441\u043a\r\nwith open(output_filename_odd, &quot;wb&quot;) as out:\r\n     pdf_writer_odd.write(out)\r\n     print(&quot;created&quot;, output_filename_odd)\r\n<\/pre>\n<p><img loading=\"lazy\" src=\"http:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/16.png\" alt=\"\" width=\"671\" height=\"180\" class=\"aligncenter size-full wp-image-620\" srcset=\"https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/16.png 671w, https:\/\/is19-2017.susu.ru\/matveev\/wp-content\/uploads\/sites\/10\/2021\/03\/16-300x80.png 300w\" sizes=\"(max-width: 671px) 100vw, 671px\" \/><\/p>\n","protected":false},"excerpt":{"rendered":"<p>\u0421\u0441\u044b\u043b\u043a\u0430 \u043d\u0430 GitHub \u0441 \u0438\u0441\u0445\u043e\u0434\u043d\u044b\u043c\u0438 \u0434\u0430\u043d\u043d\u044b\u043c\u0438 \u0438 \u0433\u043e\u0442\u043e\u0432\u044b\u043c \u043f\u0440\u043e\u0435\u043a\u0442\u043e\u043c 1. \u041d\u0435\u043e\u0431\u0445\u043e\u0434\u0438\u043c\u043e \u0443\u0441\u0442\u0430\u043d\u043e\u0432\u0438\u0442\u044c python 2. \u0414\u043b\u044f \u0440\u0430\u0431\u043e\u0442\u044b \u0441 pdf-\u0444\u0430\u0439\u043b\u0430\u043c\u0438 \u043d\u0430\u043c \u043f\u043e\u043d\u0430\u0434\u043e\u0431\u0438\u0442\u0441\u044f \u0443\u0441\u0442\u0430\u043d\u043e\u0432\u0438\u0442\u044c \u0441\u043b\u0435\u0434\u0443\u044e\u0449\u0438\u0435 \u043f\u0430\u043a\u0435\u0442\u044b \u0447\u0435\u0440\u0435\u0437 \u043a\u043e\u043c\u0430\u043d\u0434\u043d\u0443\u044e&hellip;<\/p>\n","protected":false},"author":13,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[15],"tags":[],"_links":{"self":[{"href":"https:\/\/is19-2017.susu.ru\/matveev\/wp-json\/wp\/v2\/posts\/581"}],"collection":[{"href":"https:\/\/is19-2017.susu.ru\/matveev\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/is19-2017.susu.ru\/matveev\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/is19-2017.susu.ru\/matveev\/wp-json\/wp\/v2\/users\/13"}],"replies":[{"embeddable":true,"href":"https:\/\/is19-2017.susu.ru\/matveev\/wp-json\/wp\/v2\/comments?post=581"}],"version-history":[{"count":25,"href":"https:\/\/is19-2017.susu.ru\/matveev\/wp-json\/wp\/v2\/posts\/581\/revisions"}],"predecessor-version":[{"id":622,"href":"https:\/\/is19-2017.susu.ru\/matveev\/wp-json\/wp\/v2\/posts\/581\/revisions\/622"}],"wp:attachment":[{"href":"https:\/\/is19-2017.susu.ru\/matveev\/wp-json\/wp\/v2\/media?parent=581"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/is19-2017.susu.ru\/matveev\/wp-json\/wp\/v2\/categories?post=581"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/is19-2017.susu.ru\/matveev\/wp-json\/wp\/v2\/tags?post=581"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}