קודם הבא
rdJohn logo

19.jpg

מאת rdJohn
11 אוגוסט, 2022

Entry statistics

0 צפיות
0 שבחים

Entry content

Entry 273542 in Portfolio by rdJohn

Entry content

Entry 273541 in Portfolio by rdJohn

Entry content

Entry 273540 in Portfolio by rdJohn

Entry content

Entry 273539 in Portfolio by rdJohn

Entry content

Entry 273538 in Portfolio by rdJohn

Entry content

Entry 273537 in Portfolio by rdJohn

Entry content

Entry 273536 in Portfolio by rdJohn

Entry content

Entry 273535 in Portfolio by rdJohn

Entry content

Entry 273534 in Portfolio by rdJohn

Entry content

Entry 273532 in Portfolio by rdJohn

Entry content

Entry 273531 in Portfolio by rdJohn

Entry content

Entry 273530 in Portfolio by rdJohn

Entry content

Entry 273529 in Portfolio by rdJohn

Entry content

Entry 273528 in Portfolio by rdJohn

Entry content

Entry 273527 in Portfolio by rdJohn

Entry content

Entry 273524 in Portfolio by rdJohn

Entry content

Entry 273523 in Portfolio by rdJohn

Entry content

Entry 273521 in Portfolio by rdJohn

Entry content

Entry 273520 in Portfolio by rdJohn

Entry description

DOCX Track Changes Extractor

The client asked to develop the command-line utility to extract Track Changes records from docx documents (XML files in OOXML format). I developed the utility that allowed to extract paragraphs with any changes and comments for them. python-docx library provides the API to work with docx documents but it isn't enough for the advanced extraction. I used the xpath method to do this. A challenge part was to generate numbering for the paragraphs. It was implemented from scratch by the specification and some custom rules.
הוסף תגובה 0 תגובות
אנא היכנס למערכת כדי להגיב