A powerful yet easy-to-use epub parser
The package exports a simple parser function which use epub file as input and output JavaScript object.
As it is written in TypeScript, types are already included in the package.
Inspired by @gxl/epub-parser but without dependency to fs, which allows it to use this lib with react-native.
npm install davidka/epub-parser --saveor if you prefer yarn
yarn add davidka/epub-parserimport parser from 'davidka/epub-parser'
// if you use `require` don't forget to add `.default`
// const parser = require('simple-epub-parser').default
console.log('epub content:', parser(binaryData))type: buffer
The output is an object which contains structure, sections, info along with some other properties that deals with the epub file. They start with _. I don't recommend using these properties, since they are subscribed to change. They are where they are simply because JavaScript don't have native private member variable support, and sometimes they are helpful for debugging.
structure is the parsed toc of epub file, they contain information about how the book is constructed.
sections is an array of chapters or sections under chapters, they are referred in structure. Each section object contains the raw html string and a few handy methods to help you with you needs. toMarkdown convert the current section to markdown object. toHtmlObjects converts to html object. And a note about src and href, the src and href in raw html stay untouched, but the toHtmlObjects method resolves src to base64 string, and alters href so that they make sense in the parsed epub. And the parsed href is something like #{sectionId},{hash}.
It provides some util functions as well.
They can be used via
import { parseLink, parseHTML, parseNestedObject, flattenArray } from 'davidka/epub-parser'- parseLink
- parseHTML
- parseNestedObject
- Raise an issue in the issue section.
- PRs are the best.
❤️