Skip to content

add FileMetadata extractor to tasks/utils#457

Open
alekseevpavel04 wants to merge 1 commit intoDashAISoftware:developfrom
alekseevpavel04:add-filemetadata-extractor-to-tasks-utils
Open

add FileMetadata extractor to tasks/utils#457
alekseevpavel04 wants to merge 1 commit intoDashAISoftware:developfrom
alekseevpavel04:add-filemetadata-extractor-to-tasks-utils

Conversation

@alekseevpavel04
Copy link

Summary

Added FileMetadata dataclass and extract_file_metadata fn alongside the existing type detection helper. Callers can now get structured info (row/column counts for tabular formats, dimensions for images, file list for archives, encoding for text) in one call instead of just a type string.


Type of Change

Check all that apply like this [x]:

  • Backend change
  • Frontend change
  • CI / Workflow change
  • Build / Packaging change
  • Bug fix
  • Documentation

Changes (by file)

Briefly list the important modified files and what was done.

  • DashAI/back/tasks/utils.py: added FileMetadata dataclass, five private per-type extractor fns (text, csv, json, image, archive), and the public extract_file_metadata fn that ties them together.

Notes (optional)

PIL is imported lazily inside the image extractor so it stays optional. CSV detection uses csv.Sniffer as a fallback for plain text inputs that turn out to be tabular. Archive listing is capped at 50 entries to avoid huge payloads.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant