-
-
Notifications
You must be signed in to change notification settings - Fork 311
Description
Data Quality Issue Details
Issue Type:
- Incorrect financial values (wrong numbers)
- [x ] Missing financial data (expected data not present)
- Calculation errors (formulas producing wrong results)
- Data inconsistency (different values for same metric)
- [ x] Historical data problems (changes over time)
Environment
EdgarTools Version: 15.4.0
Python Version:3.10.19
Operating System:win 11
Financial Data Details
Company/Ticker: BRO, FOX, IMAX, DIS
Form Type: 10-K
Filing Date/Period:2025 - 2019
Statement Type: all three major statements
Specific Metric/Concept:
-
for BRO: us-gaap_StockholdersEquity, us-gaap_StockholdersEquityIncludingPortionAttributableToNoncontrollingInterest
-
for FOX: us-gaap_LongTermDebtCurrent
-
for IMAX: us-gaap_WeightedAverageNumberOfSharesOutstandingBasic, us-gaap_WeightedAverageNumberOfDilutedSharesOutstanding
-
for DIS: us-gaap_CostOfGoodsAndServicesSold, us-gaap_NetCashProvidedByUsedInOperatingActivitiesContinuingOperations, us-gaap_NetCashProvidedByUsedInFinancingActivitiesContinuingOperations, us-gaap_IncomeLossFromContinuingOperationsIncludingPortionAttributableToNoncontrollingInterest,
-
XBRL concept name: (if known, e.g., "us-gaap:Revenues")
Data Issue
Expected Value:
- Issues with missing data, switching Current and Non Current debt, Missing information pre-2022, and overwritten data.
Actual Value from EdgarTools:
Code to reproduce:
from edgar import Company
company_ticker = input("Enter company ticker (e.g., MSFT): ").upper()
write_files = input("Do you want to save CSV files of the sheets? Y/N ").upper()
number_years = int(input("Enter the number(years) of annual reports desired = "))
data['company_ticker'] = company_ticker
company = Company(company_ticker)
filings = company.get_filings(form="10-K").head(number_years)
xbrls = XBRLS.from_filings(filings)
stitched_statements = xbrls.statements
income_statement = stitched_statements.income_statement()
balance_sheet = stitched_statements.balance_sheet()
cash_flow = stitched_statements.cashflow_statement()
bs_df = balance_sheet.to_dataframe()
is_df = income_statement.to_dataframe()
cf_df = cash_flow.to_dataframe()Cross-Verification
Have you verified this issue with:
- [ x] Multiple time periods for same company
- Multiple companies with same issue
- Direct SEC filing comparison
- [ x] Other financial data sources
Affects multiple periods/companies?
- Companies tested: (e.g., AAPL, MSFT, GOOGL)
- Time periods tested: (e.g., 2021-2023)
- Pattern observed: (e.g., all Q4 periods affected, only certain companies)
Expected Behavior
What should happen:
A clear description of the correct financial data that should be returned.
The Disney data is significantly compromised. The other stocks are more limited impact. I am including an Excel file with a list of more details, but I can provide more specific info if necessary. However, these should be very easy to find.
Data validation rules:
- Should the value be positive/negative?
- Expected magnitude/range?
- Should it match specific calculations?
Additional Context
- Links to official SEC filings showing correct values
- Screenshots comparing EdgarTools output vs official data
- Any patterns or hypotheses about why the data might be wrong
- Related issues or similar problems you've noticed
Impact Assessment:
- Minor (affects specific edge case)
- Moderate (affects common use cases)
- [x ] Major (affects core financial calculations)
- [x ] Critical (produces completely wrong results)
Data quality issues are high priority and will be verified against official SEC filings. Accuracy is fundamental to EdgarTools.