Skip to content

Conversation

@github-actions
Copy link
Contributor

@github-actions github-actions bot commented Jan 2, 2026

This is an automated pull request to merge tofik/pdf-text-cleaning into dev.
It was created by the [Auto Pull Request] action.

@vercel
Copy link

vercel bot commented Jan 2, 2026

The latest updates on your projects. Learn more about Vercel for GitHub.

Project Deployment Review Updated (UTC)
app Ready Ready Preview, Comment Jan 2, 2026 3:30am
1 Skipped Deployment
Project Deployment Review Updated (UTC)
portal Skipped Skipped Jan 2, 2026 3:30am

@cursor
Copy link

cursor bot commented Jan 2, 2026

PR Summary

Improves reliability of PDF generation by refining text sanitation in cleanTextForPDF for both API and app PDF generators.

  • Strip invisible/control Unicode (e.g., soft hyphen, zero‑width, bidi marks, BOM, replacement char) before other replacements
  • Apply replacements to the stripped text to normalize quotes/dashes/ellipses/arrows, etc.
  • Change fallback behavior to preserve unknown characters (instead of replacing with ?), while keeping ASCII fallbacks for common accents
  • Ensure list bullets and headings use the cleaned text consistently

Written by Cursor Bugbot for commit 57c1b7d. This will update automatically on new commits. Configure here.

@graphite-app graphite-app bot requested a review from Marfuen January 2, 2026 03:29
@graphite-app
Copy link

graphite-app bot commented Jan 2, 2026

Graphite Automations

"Auto-assign PRs to Author" took an action on this PR • (01/02/26)

1 reviewer was added to this PR based on Mariano Fuentes's automation.

@tofikwest tofikwest merged commit 67a37d5 into main Jan 2, 2026
11 of 13 checks passed
@tofikwest tofikwest deleted the tofik/pdf-text-cleaning branch January 2, 2026 03:56
claudfuen pushed a commit that referenced this pull request Jan 2, 2026
## [1.72.1](v1.72.0...v1.72.1) (2026-01-02)

### Bug Fixes

* **api:** enhance error handling for encrypted PDFs in NDA processing ([#1966](#1966)) ([d786d79](d786d79))
* **api:** handle encrypted PDFs in NDA PDF processing ([#1965](#1965)) ([2808706](2808706))
* **api:** improve text cleaning for PDF rendering by stripping invisible unicode characters ([#1962](#1962)) ([67a37d5](67a37d5))
@claudfuen
Copy link
Contributor

🎉 This PR is included in version 1.72.1 🎉

The release is available on GitHub release

Your semantic-release bot 📦🚀

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants