Xteink-X4-crosspoint-reader

mirror of https://github.com/daveallie/crosspoint-reader.git synced 2026-02-06 07:37:37 +03:00

Author	SHA1	Message	Date
Daniel	06ced8f2d1	perf: optimize large EPUB indexing from O(n²) to O(n log n) Three optimizations for EPUBs with many chapters (e.g. 2768 chapters): 1. OPF idref→href lookup: Build sorted hash index during manifest parsing, use binary search during spine resolution. Reduces ~4min to ~30-60s. 2. TOC href→spineIndex lookup: Build sorted hash index in beginTocPass(), use binary search in createTocEntry(). Reduces ~4min to ~30-60s. 3. ZIP central-dir cursor: Resume scanning from last position instead of restarting from beginning. Reduces ~8min to ~1-3min. All optimizations only activate for large EPUBs (≥400 spine items). Small books use unchanged code paths. Memory impact: ~33KB + ~39KB temporary during indexing, freed after. Expected total: ~17min → ~3-5min for Shadow Slave (2768 chapters). Also adds phase timing logs for performance measurement.	2026-01-20 23:35:54 -08:00
Arthur Tazhitdinov	8824c87490	feat: dict based Hyphenation (#305 ) ## Summary * Adds (optional) Hyphenation for English, French, German, Russian languages ## Additional Context * Included hyphenation dictionaries add approximately 280kb to the flash usage (German alone takes 200kb) * Trie encoded dictionaries are adopted from hypher project (https://github.com/typst/hypher) * Soft hyphens (and other explicit hyphens) take precedence over dict-based hyphenation. Overall, the hyphenation rules are quite aggressive, as I believe it makes more sense on our smaller screen. --------- Co-authored-by: Dave Allie <dave@daveallie.com>	2026-01-19 12:56:26 +00:00
Pavel Liashkov	0332e1103a	Add EPUB 3 nav.xhtml TOC support (#197 ) ## Summary * What is the goal of this PR? Add EPUB 3 support by implementing native navigation document (nav.xhtml) parsing with NCX fallback, addressing issue Fixes: #143. * What changes are included? - New `TocNavParser` for parsing EPUB 3 HTML5 navigation documents (`<nav epub:type="toc">`) - Detection of nav documents via `properties="nav"` attribute in OPF manifest - Fallback logic: try EPUB 3 nav first, fall back to NCX (EPUB 2) if unavailable - Graceful degradation: books without any TOC now load with a warning instead of failing ## Additional Context * The implementation follows the existing streaming XML parser pattern using Expat to minimize RAM usage on the ESP32-C3 * EPUB 3 books that include both nav.xhtml and toc.ncx will prefer the nav document (per EPUB 3 spec recommendation) * No breaking changes - existing EPUB 2 books continue to work as before * Tested on examples from https://idpf.github.io/epub3-samples/30/samples.html	2026-01-03 19:10:35 +11:00
Jonas Diemer	03f0ce04cc	Feature: go to text/start reference in epub guide section at first start (#156 ) This parses the guide section in the content.opf for text/start references and jumps to this on first open of the book. Currently, this behavior will be repeated in case the reader manually jumps to Chapter 0 and then re-opens the book. IMO, this is an acceptable edge case (for which I couldn't see a good fix other than to drag a "first open" boolean around). --------- Co-authored-by: Sam Davis <sam@sjd.co> Co-authored-by: Dave Allie <dave@daveallie.com>	2025-12-30 23:02:46 +11:00
Dave Allie	be1b5bad21	Parse the author name from content.opf file (#165 ) ## Summary * Parse the author name from content.opf file * Listed in the dc:creator tag within the metadata section	2025-12-30 22:15:44 +11:00
Dave Allie	fb5fc32c5d	Add exFAT support (#150 ) ## Summary * Swap to updated SDCardManager which uses SdFat * Add exFAT support * Swap to using FsFile everywhere * Use newly exposed `SdMan` macro to get to static instance of SDCardManager * Move a bunch of FsHelpers up to SDCardManager	2025-12-30 16:09:30 +11:00
Dave Allie	b6bc1f7ed3	New book.bin spine and table of contents cache (#104 ) ## Summary * Use single unified cache file for book spine, table of contents, and core metadata (title, author, cover image) * Use new temp item store file in OPF parsing to store items to be rescaned when parsing spine * This avoids us holding these items in memory * Use new toc.bin.tmp and spine.bin.tmp to build out partial toc / spine data as part of parsing content.opf and the NCX file * These files are re-read multiple times to ultimately build book.bin ## Additional Context * Spec for file format included below as an image * This should help with: * #10 * #60 * #99	2025-12-24 22:36:13 +11:00
Dave Allie	0d32d21d75	Small code cleanup (#83 ) Some checks are pending CI / build (push) Waiting to run Details ## Summary * Fix cppcheck low violations * Remove teardown method on parsers, use destructor * Code cleanup	2025-12-21 15:43:53 +11:00
Dave Allie	c262f222de	Parse cover image path from content.opf file (#24 ) Some checks are pending CI / build (push) Waiting to run Details	2025-12-16 03:15:54 +11:00
Dave Allie	c7a32fe41f	Remove tinyxml2 dependency replace with expat parsers (#9 )	2025-12-13 19:36:01 +11:00

10 Commits