WillyTheFish Posted July 30, 2010 Share Posted July 30, 2010 Hey guys, any suggestions how i could read and write msWord files without using the COM interface. I need to keep all the formatting intact though... any ideas? many thanks Quote Link to comment Share on other sites More sharing options...
Wolphie Posted July 30, 2010 Share Posted July 30, 2010 http://php.net/manual/en/book.simplexml.php Quote Link to comment Share on other sites More sharing options...
WillyTheFish Posted July 30, 2010 Author Share Posted July 30, 2010 hm okay thanks... to be honest I don't get it... how do I process a .doc file with this library? Quote Link to comment Share on other sites More sharing options...
Mchl Posted July 30, 2010 Share Posted July 30, 2010 .docx files (Office 2007 and newer) are in fact XML files, so maybe that's what Wolphie has had in mind. This doesn't resolve problem with previous versions of .doc format. Quote Link to comment Share on other sites More sharing options...
Wolphie Posted July 30, 2010 Share Posted July 30, 2010 .doc file or .docx? Completely two different formats. MS Word 2007 and higher uses .DOCX which is essentially an archive that has an XML file within it that stores all of the data. This can easily be traversed with the SimpleXML library. If it's a .DOC file then I can't help you, sorry. Quote Link to comment Share on other sites More sharing options...
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.