Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Cleanup from #2489 #2802

Closed
lauriemerrell opened this issue Jul 14, 2023 · 11 comments · Fixed by #2804
Closed

Cleanup from #2489 #2802

lauriemerrell opened this issue Jul 14, 2023 · 11 comments · Fixed by #2804
Assignees

Comments

@lauriemerrell
Copy link
Contributor

lauriemerrell commented Jul 14, 2023

Need to fix a bunch of errors now that full refresh from #2489 has run (using this ticket to make investigation notes):

  • dbt_utils_unique_combination_of_columns_int_gtfs_rt__trip_updates_trip_day_map_grouping_key__dt__service_date__trip_schedule_relationship__trip_direction_id.bb37665ec3 - Got 4334 results
    • Seems like at least part of the issue here is a timing issue when an existing RT feed has its corresponding schedule feed changed.
  • not_null_fct_observed_trips_vp_name.069061b91b - Got 6153 results
  • unique_fct_observed_trips_trip_instance_key.e5375de50e - Got 3054 results
  • dbt_utils_unique_combination_of_columns_int_gtfs_rt__service_alerts_trip_day_map_grouping_key__dt__service_date.d818d7c661 - Got 3 results
  • not_null_fct_observed_trips_tu_name.2e55990a62 - Got 5353 results
  • unique_fct_trip_updates_summaries_trip_instance_key.b39f628c7c - Got 2882 results
  • unique_fct_trip_updates_summaries_key.cb3b97a678 - Got 3864 results
Queries to see failures
with tbl AS (
  SELECT *
  FROM `cal-itp-data-infra`.`staging`.`int_gtfs_rt__trip_updates_trip_day_map_grouping`
),

validation_errors as (

    select
        key, dt, service_date, trip_schedule_relationship, trip_direction_id
    from tbl
    group by key, dt, service_date, trip_schedule_relationship, trip_direction_id
    having count(*) > 1

)

select validation_errors.*, tbl.* EXCEPT(header_timestamps_array, trip_update_timestamps_array, message_keys_array, extract_ts_array, message_ids_array, skipped_stops_array, scheduled_stops_array, canceled_stops_array,added_stops_array)
from validation_errors
left join tbl using (key, dt, service_date, trip_schedule_relationship, trip_direction_id)
ORDER BY validation_errors.key, validation_errors.dt, validation_errors.service_date
@lauriemerrell lauriemerrell self-assigned this Jul 14, 2023
@cal-itp-sentry
Copy link

Sentry issue: CAL-ITP-DATA-INFRA-268M

@cal-itp-sentry
Copy link

Sentry issue: CAL-ITP-DATA-INFRA-268T

@cal-itp-sentry
Copy link

Sentry issue: CAL-ITP-DATA-INFRA-268P

@cal-itp-sentry
Copy link

Sentry issue: CAL-ITP-DATA-INFRA-268V

@cal-itp-sentry
Copy link

Sentry issue: CAL-ITP-DATA-INFRA-268R

@cal-itp-sentry
Copy link

Sentry issue: CAL-ITP-DATA-INFRA-268S

@cal-itp-sentry
Copy link

Sentry issue: CAL-ITP-DATA-INFRA-268W

@cal-itp-sentry
Copy link

Sentry issue: CAL-ITP-DATA-INFRA-268Q

@cal-itp-sentry
Copy link

Sentry issue: CAL-ITP-DATA-INFRA-268E

@cal-itp-sentry
Copy link

Sentry issue: CAL-ITP-DATA-INFRA-268D

@cal-itp-sentry
Copy link

Sentry issue: CAL-ITP-DATA-INFRA-2685

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant