-
Notifications
You must be signed in to change notification settings - Fork 5
Convert Word Doc to Other Formats
shoaibkhan-aspose edited this page Apr 16, 2014
·
1 revision
HWPFDocumentCore wordDocument = WordToHtmlUtils.loadDoc(new FileInputStream("data/document.doc"));
WordToHtmlConverter wordToHtmlConverter = new WordToHtmlConverter(
DocumentBuilderFactory.newInstance().newDocumentBuilder()
.newDocument());
wordToHtmlConverter.processDocument(wordDocument);
Document htmlDocument = wordToHtmlConverter.getDocument();
ByteArrayOutputStream out = new ByteArrayOutputStream();
DOMSource domSource = new DOMSource(htmlDocument);
StreamResult streamResult = new StreamResult(out);
TransformerFactory tf = TransformerFactory.newInstance();
Transformer serializer = tf.newTransformer();
serializer.setOutputProperty(OutputKeys.ENCODING, "UTF-8");
serializer.setOutputProperty(OutputKeys.INDENT, "yes");
serializer.setOutputProperty(OutputKeys.METHOD, "html");
serializer.transform(domSource, streamResult);
out.close();
FileOutputStream outputStream = new FileOutputStream("data/ApacheDocToHTML.html");
outputStream.write(out.toByteArray());
outputStream.close();
// Load the document from disk.
Document doc = new Document("data/document.doc");
doc.save("data/html/AsposeDocToHTML.html",SaveFormat.HTML); //Save the document in HTML format.
doc.save("data/AsposeDocToPDF.pdf",SaveFormat.PDF); //Save the document in PDF format.
doc.save("data/AsposeDocToTxt.txt",SaveFormat.TEXT); //Save the document in TXT format.
doc.save("data/AsposeDocToJPG.jpg",SaveFormat.JPEG); //Save the document in JPEG format.
Download Source Code
For further releases and updates, please follow these new repositories:
- Aspose.Words Java for Apache POI
- Aspose.Cells Java for Apache POI
- Aspose.Slides Java for Apache POI
- Aspose.Email Java for Apache POI
- For documentation, please visit Aspose Docs.
- Raise your queries and suggest more examples via Aspose Forums or via new social coding sites.