These code samples implement the dataset and workflow result export features of Texera (https://github.com/apache/texera), an open-source Apache incubating platform for collaborative data analytics.
The Texera platform supports creation of workflows for data analytics and AI/ML. Datasets and the generated tables of data from workflow executions can be downloaded to the local filesystem through this export system. This export system enables browser native download, ensuring that download progress will be shown in the browser UI.
The original pull requests outlining my contributions are the following:
- Dataset Export: apache/texera#3621
- Workflow Result Export: apache/texera#3728
The flow of frontend to backend logic are as follows:
- Dataset Export:
download.service.ts→dataset.service.ts→DatasetResource.scala→S3StorageClient.scala - Workflow Result Export:
download.service.ts→WorkflowExecutionsResource.scala→ResultExportService.scala→ [remainder not included]
This work is derived from the open-source Texera project, licensed under the Apache License 2.0. Original license headers are retained in all source files, and modifications are clearly indicated.