MarkSentry – zero-trust document-to-Markdown for RAG pipelines
1 points, 0 comments on Hacker News
5 headlines
1 points, 0 comments on Hacker News
We're open-sourcing 14 components & examples today for PDF, DOCX, and XLSX viewers, plus bounding box citations, file upload, e-signature, and more. It's MIT licensed and fully customizable. Demo video here: https://share.
There is a file in almost every backend I have worked on. It generates PDFs. Invoices, mostly.
3 points, 2 comments on Hacker News
HTML is a language that is quite similar to the text, but lighter, made up of elements/tags which is used on the text to help understand what is the function of each one (Is it a paragraph ? Is it a heading ? Is it a table ) ?