Enable vertical text detection for rotated images#4328
Merged
vladimir-kivi-ds merged 16 commits intomainfrom Apr 14, 2026
Merged
Enable vertical text detection for rotated images#4328vladimir-kivi-ds merged 16 commits intomainfrom
vladimir-kivi-ds merged 16 commits intomainfrom
Conversation
Contributor
Author
badGarnet
reviewed
Apr 9, 2026
Collaborator
badGarnet
left a comment
There was a problem hiding this comment.
would be good to have a behavior change test: e.g., a rotated page partitioned by hi-res and detect text grouping are now correct
Contributor
Author
Smth like this? |
…ures update (#4331) This pull request includes updated ingest test fixtures. Please review and merge if appropriate. <!-- CURSOR_SUMMARY --> --- > [!NOTE] > **Low Risk** > Updates are limited to test fixture golden files (HTML/JSON) with small text/id changes, so production behavior is unaffected; risk is mainly around masking or legitimizing unintended extraction regressions. > > **Overview** > Updates ingest golden fixtures for `layout-parser-paper.pdf` structured output in both HTML and JSON. > > The expected extracted content changes slightly (author line character corrections and an added trailing page number in a `ListItem`) and corresponding `element_id`s are updated to match the new extraction output. > > <sup>Reviewed by [Cursor Bugbot](https://cursor.com/bugbot) for commit 191ba7e. Bugbot is set up for automated code reviews on this repo. Configure [here](https://www.cursor.com/dashboard/bugbot).</sup> <!-- /CURSOR_SUMMARY --> Co-authored-by: vladimir-kivi-ds <vladimir-kivi-ds@users.noreply.github.com>
… github.com:Unstructured-IO/unstructured into vk/enable-vertical-text-detection-for-rotated-pages
adlfs changed in 2026.4.0 with a breaking auth change: the default anon behavior flipped to False, so code that previously hit public Azure blobs anonymously now tries DefaultAzureCredential unless anon=True is set explicitly. that matches the CI failure here. this pins `adlfs==2026.2.0` to temporarily unblock failing changes when bumping to latest <!-- CURSOR_SUMMARY --> --- > [!NOTE] > **Medium Risk** > Pins `adlfs` to an older version to work around an Azure public-container ingest regression, which can affect Azure connector behavior at runtime. The rest is CI-only fixture update plumbing and is low risk. > > **Overview** > Pins `adlfs` to `==2026.2.0` via `pyproject.toml` UV constraints (and updates `uv.lock`) to avoid a regression affecting anonymous access to public Azure blob containers. > > Updates the `ingest-test-fixtures-update-pr` GitHub Actions workflow to also generate and include `expected-structured-output-markdown` fixtures in the auto-created PRs, and records this change in the `CHANGELOG.md`. > > <sup>Reviewed by [Cursor Bugbot](https://cursor.com/bugbot) for commit 88a7801. Bugbot is set up for automated code reviews on this repo. Configure [here](https://www.cursor.com/dashboard/bugbot).</sup> <!-- /CURSOR_SUMMARY --> --------- Co-authored-by: Vladimir Kirilenko <vladimir.kirilenko@deepsense.ai>
badGarnet
approved these changes
Apr 10, 2026
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
No description provided.