Skip to content

feat: restructure unstructured for local upload#766

Open
nboyse wants to merge 2 commits intodevfrom
feat/restructure-unstructured-for-local-upload
Open

feat: restructure unstructured for local upload#766
nboyse wants to merge 2 commits intodevfrom
feat/restructure-unstructured-for-local-upload

Conversation

@nboyse
Copy link
Copy Markdown
Contributor

@nboyse nboyse commented Jan 27, 2026

Signed-off-by: DBT pre-commit check

Context

Unstructured really struggles locally with documents being processed

What

  • Removes auto strategy being inferred allt he time
  • Image heavy have hi res config specified explicitly
  • Page split properly for image heavy
  • Dont infer tables on image heavy docs
  • Less retries as they just overload cpu

Have you written unit tests?

  • Yes
  • No (add why you have not)

Are there any specific instructions on how to test this change?

  • Yes (if so provide more detail)
  • No

Relevant links

Signed-off-by: DBT pre-commit check
@nboyse nboyse requested a review from a team as a code owner January 27, 2026 16:11
@github-actions
Copy link
Copy Markdown
Contributor

⚠️ Python Vulnerabilities detected: 8 unique issues.

High:

Moderate:

Unknown (to be checked):

@github-actions
Copy link
Copy Markdown
Contributor

Your PR has commits that are missing the Signed-off-by trailer. This is likely due to the pre-commit hook not being configured on your local machine. The usual fix for this issue is to run pre-commit install --install-hooks --overwrite -t commit-msg -t pre-commit, however for more detailed help in setting up the pre-commit hooks, follow the instructions at https://github.com/uktrade/github-standards/blob/main/README.md#usage

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant