Add Spark script example for WXD-Confluent TableFlow integration by shibil-rahman · Pull Request #41 · IBM/watsonx-data

shibil-rahman · 2026-03-11T08:06:49Z

📋 Summary

This PR adds a new tutorial demonstrating how to integrate IBM watsonx.data with Confluent Tableflow to read data from Confluent-managed Iceberg tables using WXD Spark.

📁 Changes

New Directory: Tutorials/WXD - Confluent Integration/
Files Added:
- read_confluent_table_standalone.py - Complete PySpark script for Confluent Tableflow integration
- README.md - Comprehensive documentation with usage instructions

✨ Features

The tutorial provides:

Confluent Tableflow Integration: Connect to Confluent's REST catalog using API credentials
Auto-Discovery: Automatically discovers available namespaces and tables in the catalog
Table Inspection: Describes table schemas and displays metadata
Data Querying: Retrieves and displays sample data from Confluent Tableflow tables
Standalone Execution: Runs independently with embedded Spark configuration

📖 Documentation Highlights

The README includes:

Clear overview of what the integration does
Storage authentication options:
- ✅ Confluent Managed Storage (no additional config needed)
- ✅ Integrated AWS S3 Storage (with required S3 credentials)
Three execution methods:
1. 🔬 Using SparkLab (VS Code Development Environment)
2. 🚀 Submit via Spark Application REST API
3. 💻 Submit via CPDCTL CLI
Configuration parameters reference
Troubleshooting guide
Links to relevant IBM watsonx.data documentation

🎯 Use Cases

This integration enables users to:

Query Confluent Tableflow data directly from watsonx.data
Leverage Spark's processing capabilities on Confluent-managed data
Build data pipelines that span both platforms
Perform analytics on streaming data stored in Confluent

✅ Testing

Script tested with Confluent Tableflow REST catalog
Verified auto-discovery of namespaces and tables
Confirmed data retrieval and display functionality

shibil-rahman · 2026-03-11T17:40:03Z

Hi @liuljun, This Spark example use-case is required for documenting in WXD official pages, looking forward to make this script available in public repo. Need your help on this. Thanks.

shibil-rahman-p1 added 2 commits March 11, 2026 13:07

Add scripts for WXD Confluent Integration

460e248

Update README with additional changes

dba338a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Spark script example for WXD-Confluent TableFlow integration#41

Add Spark script example for WXD-Confluent TableFlow integration#41
shibil-rahman wants to merge 2 commits intoIBM:mainfrom
shibil-rahman:WXD_confluent

shibil-rahman commented Mar 11, 2026

Uh oh!

shibil-rahman commented Mar 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

shibil-rahman commented Mar 11, 2026

📋 Summary

📁 Changes

✨ Features

📖 Documentation Highlights

🎯 Use Cases

✅ Testing

Uh oh!

shibil-rahman commented Mar 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant