Support for style::shrink-to-fit of ODF 1.3
The document [MS-OODF13] states in item a) in 2.1.1275 Part 3 Section 20.360, style:shrink-to-fit, "This attribute is supported in Word 2013 and later." But it is not supported. (Sadly I cannot attach an .odt file here.) Item c) in the same…
DOCX needs repaired, but repair doesn't work and validator says there's no issues
Hello, I have a question about a docx file. It was generated using our own doc gen engine. When you open the file it says the document needs to be repaired, however when you run the file through the OOXML validator, it says there are no issues with the…
How to differentiate between MSI,PUB file vs DOC/XLS/PPT files
What is the correct technical way to differentiate between MSI,PUB,MSG vs DOC,XLS,PPT files. Basically all these files are of ole type and have magic header as D0 CF 11 E0 A1 B1 1A E1 Would like to know what is the correct way to find the file type for…
Generate Organization Chart In OpenXML / Powerpoint / Excel using Python
Hi, I would like to generate a powerpoint organization chart. I'ved found this but it does not provide much information that I…
List of all URLs in docx file
Hi, I am trying to analyze various .docx files (OOXML) for purposes of finding maliciousness in them. Now i know that one way is to hide malicious URLs. Now, due to the file structure of these files, they contain a lot of links that look like this: …
Document Signing
Hello, I need the OID of X509 certificate types for digitally signing using certificate of MS Word documents: DOC and DOCX and PDF. I guess they should not have the same OIDs, not sure though, anyone knows? :) Best.
OOXML Documentation
Hi, As per my understanding, the .docx , .pptx, and .xlsx files belong to Microsoft's OOXML file format. Please correct me if wrong. I was looking for the technical documentation of this file format but could not find one. Could you kindly guide me to…
![](https://techprofile.blob.core.windows.net/images/505cbe45b8fa4b8e85d7f6b479d92980.png)
Finding VBA Compressed source code in ole file
Hi, With reference to [MS-OVBA]. I am parsing a .doc file (MS-97-2003) in Python and i am able to read the directory structure. The directory structure is given at the end here. I know that the entry 'Module1' contains some compressed VBA code. Although…
How to disable a directory entry in OLE file
Hi, I am trying to parse a .doc file (OLE file) (MS-97-2003) by reading it byte by byte in python. I am so far successful in extracting the directory structure of the file. Suppose i find a directory named "Macros" in the .doc file and i want…
How to identify number of tokens in TokenSequence. MS-OVBA
Hi, With reference to MS-OVBAhttps://video2.skills-academy.com/en-us/openspecs/office_file_formats/ms-ovba/575462ba-bf67-4190-9fac-c275523c75fc Section 2.4.1.1.7 Token Sequence It is mentioned that: "The number of Tokens in the final TokenSequence MUST be…
Keywords of directory names in a .doc file
Hi, I am trying to fetch the directory structure that’s inside a .doc file (OLE file) (MS 97-2003). I am able to fetch the names of streams and storages (like VBA, Macro, etc) correctly by following the directory chain specified in the FAT sector of…
![](https://techprofile.blob.core.windows.net/images/1JlamiBeBk2fQ98X64TYTw.png?8D8B27)
Fetching the FIB of a .doc file
I am trying to parse a (.doc) file (Microsoft Word 97-2003 Document) for purposes of extracting the FIB (File Information Block). In the Documentation given in this link: https://interoperability.blob.core.windows.net/files/MS-DOC/%5bMS-DOC%5d.pdf it…
How to fix error069040_01.xml in Excel
Hello, I open my excel file and the system informs that there is a problem with the content and that it would try to recover as much information as possible. Once I accept, the message is the following: <recoveryLog…
![](https://techprofile.blob.core.windows.net/images/P4hBsPCdy0qqJ__3IqkHfw.png?8DB46E)
OpenXML SDK: How do I find the formula in a CellFormula where the type is "dataTable"?
I'm writing an application in C# to read formulas from various cells in an Excel spreadsheet, and one section is giving me trouble. In Excel, it displays the formula as "{=TABLE(,B5)}" for all cells in this range. I stepped through my C#…
.xlsb file in DocumentFormat.OpenXml.Packaging throwing error of OpenXmlPackageException: 'The specified package is not valid.'
.xlsb file in DocumentFormat.OpenXml.Packaging throwing error of OpenXmlPackageException: 'The specified package is not valid.'
![](https://techprofile.blob.core.windows.net/images/xgCloIrQvUeqYfAqQE8a0A.png?8D8342)
Default values for attributes in clrSchemeMapping do not exist
Look at 2.1.395 Part 1 Section 17.15.1.20, clrSchemeMapping (Theme Color Mappings) in [MS-OI29500]. It has a remark, that default values "t1" and "t2" do not exist. I think, that there should be similar remarks for the default values…
Word document is corrupt for unknown reason
Hello, I have a Word document that was generated using our product that is corrupt for an unknown reason. Their input file is a docm file and I have a docx and a docm output file which says it is corrupt. I have run both documents through an OOXML…
I found an error in your doc
https://video2.skills-academy.com/en-us/openspecs/office_file_formats/ms-ovba/c375cbf2-b9c6-4594-b064-6dfb830f1851 The size of Data in bytes MUST be CompressedChunk.CompressedChunkHeader.CompressedChunkSize (section 2.4.1.1.4) plus 3. This should be plus 1.
What is invalid about this docx?
Hi, I generated this docx with my software. The Open XML SDK productivity tool does show some validation errors, but fixing these errors doesn't make it so this docx can be opened in Word. Usually when I have an invalid docx word gives me the option to…
![](https://techprofile.blob.core.windows.net/images/1JlamiBeBk2fQ98X64TYTw.png?8D8B27)
Updated writeProtection logic
Hi, it looks like MS have changed how the writeProtection element (one definition found https://video2.skills-academy.com/en-us/dotnet/api/documentformat.openxml.wordprocessing.writeprotection?view=openxml-2.8.1) behaves in Word documents. Prior to Word 365…