Skip to content

[DATA] issues with several companies in data stitching process (All three financial sheets ) #642

@miruddfan

Description

@miruddfan

Data Quality Issue Details

Issue Type:

  • Incorrect financial values (wrong numbers)
  • [x ] Missing financial data (expected data not present)
  • Calculation errors (formulas producing wrong results)
  • Data inconsistency (different values for same metric)
  • [ x] Historical data problems (changes over time)

Environment

EdgarTools Version: 15.4.0
Python Version:3.10.19
Operating System:win 11

Financial Data Details

Company/Ticker: BRO, FOX, IMAX, DIS
Form Type: 10-K
Filing Date/Period:2025 - 2019
Statement Type: all three major statements

Specific Metric/Concept:

  • for BRO: us-gaap_StockholdersEquity, us-gaap_StockholdersEquityIncludingPortionAttributableToNoncontrollingInterest

  • for FOX: us-gaap_LongTermDebtCurrent

  • for IMAX: us-gaap_WeightedAverageNumberOfSharesOutstandingBasic, us-gaap_WeightedAverageNumberOfDilutedSharesOutstanding

  • for DIS: us-gaap_CostOfGoodsAndServicesSold, us-gaap_NetCashProvidedByUsedInOperatingActivitiesContinuingOperations, us-gaap_NetCashProvidedByUsedInFinancingActivitiesContinuingOperations, us-gaap_IncomeLossFromContinuingOperationsIncludingPortionAttributableToNoncontrollingInterest,

  • XBRL concept name: (if known, e.g., "us-gaap:Revenues")

Data Issue

Expected Value:

  • Issues with missing data, switching Current and Non Current debt, Missing information pre-2022, and overwritten data.
    Actual Value from EdgarTools:

Code to reproduce:

from edgar import Company

company_ticker = input("Enter company ticker (e.g., MSFT): ").upper()
write_files = input("Do you want to save CSV files of the sheets? Y/N ").upper()
number_years = int(input("Enter the number(years) of annual reports desired = "))
data['company_ticker'] = company_ticker
company = Company(company_ticker)
filings = company.get_filings(form="10-K").head(number_years)
xbrls = XBRLS.from_filings(filings)
stitched_statements = xbrls.statements
income_statement = stitched_statements.income_statement()
balance_sheet = stitched_statements.balance_sheet()
cash_flow = stitched_statements.cashflow_statement()
bs_df = balance_sheet.to_dataframe()
is_df = income_statement.to_dataframe()
cf_df = cash_flow.to_dataframe()

Cross-Verification

Have you verified this issue with:

  • [ x] Multiple time periods for same company
  • Multiple companies with same issue
  • Direct SEC filing comparison
  • [ x] Other financial data sources

Affects multiple periods/companies?

  • Companies tested: (e.g., AAPL, MSFT, GOOGL)
  • Time periods tested: (e.g., 2021-2023)
  • Pattern observed: (e.g., all Q4 periods affected, only certain companies)

Expected Behavior

What should happen:
A clear description of the correct financial data that should be returned.

The Disney data is significantly compromised. The other stocks are more limited impact. I am including an Excel file with a list of more details, but I can provide more specific info if necessary. However, these should be very easy to find.

Data validation rules:

  • Should the value be positive/negative?
  • Expected magnitude/range?
  • Should it match specific calculations?

Additional Context

  • Links to official SEC filings showing correct values
  • Screenshots comparing EdgarTools output vs official data
  • Any patterns or hypotheses about why the data might be wrong
  • Related issues or similar problems you've noticed

Impact Assessment:

  • Minor (affects specific edge case)
  • Moderate (affects common use cases)
  • [x ] Major (affects core financial calculations)
  • [x ] Critical (produces completely wrong results)

Data quality issues are high priority and will be verified against official SEC filings. Accuracy is fundamental to EdgarTools.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions