Date Number Title
February 1, 2021, 10:44 AM PST 1085 [PA] backfill total tests historical data from new state data source
February 1, 2021, 10:44 AM PST 1085 [PA] backfill total tests historical data from new state data source
November 6, 2020, 9:37 AM PST 933 [PA] Patch 11/3-11/5 cur. icu
September 16, 2020, 8:01 AM PDT 839 [PA] Explicit Probables Backfill
September 16, 2020, 7:34 AM PDT 838 [PA] Incorrect Positive (PCR) Cases value on 5/25
September 4, 2020, 1:28 PM PDT 814 [PA] Move total tests historic time-series to `Total Tests PCR (People)`
August 17, 2020, 9:03 PM PDT 773 [PA] SVP Fixes: Move total test values from "Total PCR Tests (People)" to "Total Tests (PCR)" and backfill Totals to 3/11
July 30, 2020, 8:05 AM PDT 705 [PA]: PCL Cases Historical
July 13, 2020, 10:52 AM PDT 618 [PA] Case counts after antibody testing reporting ended on 6/11
May 2, 2020, 8:40 PM PDT 347 PA's Positive counts for April 22-23 may be incorrect
April 29, 2020, 4:12 AM PDT 325 PA deaths neagtive on 4/23
April 24, 2020, 9:24 PM PDT 291 [States Daily CSV] PA deathIncrease is VERY negative for 2020-04-23
April 23, 2020, 3:06 PM PDT 274 [States Daily CSV] PA deaths is lower 2020-04-23 than 2020-04-22
April 2, 2020, 6:20 PM PDT 126 PA cumulative hospitalized overcounting
March 29, 2020, 12:01 PM PDT 76 PA: cumulative hospitalizations

#1085: [PA] backfill total tests historical data from new state data source

Issue number 1085

theomichel opened this issue on February 1, 2021, 10:44 AM PST

Labels Backfill Data quality

Open in Github

Target Date: This week Overview: backfill up to most recent date and let daily DE continue.

Comments

space-buzzer commented on February 1, 2021, 5:12 PM PST
Open in Github

#1085: [PA] backfill total tests historical data from new state data source

Issue number 1085

theomichel opened this issue on February 1, 2021, 10:44 AM PST

Labels Backfill Data quality

Open in Github

Target Date: This week Overview: backfill up to most recent date and let daily DE continue.

Comments

space-buzzer commented on February 1, 2021, 5:12 PM PST
Open in Github

#933: [PA] Patch 11/3-11/5 cur. icu

Issue number 933

hmhoffman opened this issue on November 6, 2020, 9:37 AM PST

Open in Github

State: PA

Dates affected: 11/3-11/5

Describe the issue: As of 11/3, PA reports patients currently in the ICU for COVID-19 in their press releases(PRs). We began capturing this metric on 11/6. We should patch cur. icu data for 11/3-11/5 from press releases.

Comments

hmhoffman commented on November 6, 2020, 9:40 AM PST
Open in Github

BEFORE: Screen Shot 2020-11-06 at 12 38 59 PM

AFTER: Screen Shot 2020-11-06 at 12 40 13 PM

#839: [PA] Explicit Probables Backfill

Issue number 839

jesseandersonumd opened this issue on September 16, 2020, 8:01 AM PDT

Labels Backfill Historical Data

Open in Github

State or US: PA

Describe the problem PA started providing probables and confirmed values on 5/22. CTP started tracking confirmed values on 5/22 but started reporting probables on 7/28. We can backfill using positive - positive PCR cases and screenshots to have probables back until 5/22.

Sources: https://covidtracking.com/data/state/pennsylvania/screenshots

NEW VALUES FOR PROBABLES (5/22-9/15):

4264 4221 4182 4182 4126 4074 4049 4014 4001 3992 3982 3931 3897 3859 3834 3814 3801 3778 3741 3726 3701 3685 3652 3641 3627 3602 3576 3544 3529 3490 3474 3454 3438 3390 3391 3356 3355 3347 3337 3310 3280 3252 3229 3185 3174 3157 3141 3120 3080 3053 3036 3024 2993 2969 2950 2913 2890 2866 2863 2853 2827 2814 2792 2765 2727 2714 2688 2670 2665 2633 2608 2599 2587 2562 2536 2501 2491 2476 2459 3540 2414 2414 2396 2381 2349 2327 2325 2319 2311 2287 2275 2253 2238 2231 2225 2199 2814 2782 2758 2736 2733 2727 2713 2679 2661 2646 2620 2630 2607 2574 1938 1902 1858 1839 1807 1774 1707

Comments

jesseandersonumd commented on September 16, 2020, 8:04 AM PDT
Open in Github

BEFORE: Screen Shot 2020-09-16 at 11 03 21 AM

jesseandersonumd commented on September 16, 2020, 8:04 AM PDT
Open in Github

AFTER: Screen Shot 2020-09-16 at 11 03 52 AM

the-daniel-lin commented on September 24, 2020, 11:26 AM PDT
Open in Github

DC'd by DZL 9/24 14:26

#838: [PA] Incorrect Positive (PCR) Cases value on 5/25

Issue number 838

jesseandersonumd opened this issue on September 16, 2020, 7:34 AM PDT

Labels Data quality

Open in Github

State or US: PA

Describe the problem We reported 68186 positive (PCR) cases on 5/25. This is incorrect according to screenshots: https://covidtracking.com/screenshots/PA/PA-20200525-184018.png.

Link to data source

  • https://covidtracking.com/screenshots/PA/PA-20200525-184018.png

Comments

jesseandersonumd commented on September 16, 2020, 7:50 AM PDT
Open in Github

BEFORE: Screen Shot 2020-09-16 at 10 49 28 AM

jesseandersonumd commented on September 16, 2020, 7:50 AM PDT
Open in Github

AFTER: Screen Shot 2020-09-16 at 10 49 40 AM

#814: [PA] Move total tests historic time-series to `Total Tests PCR (People)`

Issue number 814

muamichali opened this issue on September 4, 2020, 1:28 PM PDT

Labels Data quality stale

Open in Github

State or US: Pennsylvania

Describe the problem Pennsylvania clarified that their negatives number is in people units, so we are moving the total tests time-series to Total Tests PCR (People)

Link to data source Will add

Comments

muamichali commented on September 4, 2020, 1:32 PM PDT
Open in Github

States Before image

States After image

Contentful After image

States Daily Before image

stale[bot] commented on September 19, 2020, 2:12 PM PDT
Open in Github

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions!

stale[bot] commented on September 29, 2020, 2:28 PM PDT
Open in Github

This issue has been closed because it was stale for 15 days, and there was no further activity on it for 10 days. You can feel free to re-open it if the issue is important, and label it as "not stale."

#773: [PA] SVP Fixes: Move total test values from "Total PCR Tests (People)" to "Total Tests (PCR)" and backfill Totals to 3/11

Issue number 773

jaclyde opened this issue on August 17, 2020, 9:03 PM PDT

Labels Data quality Historical Data stale

Open in Github

State: PA

Dates impacted: 8/17 - 3/11

Issues: PA report positive cases and negatives. We add these values to get Total Tests, however the units of the negative tests is unclear. We are moving the Total Test values from "Total PCR Tests (People)" to "Total Tests (PCR)."

Additionally, we are backfilling the Total Tests (PCR) metric from 3/11-5/17 using the positive case values plus the negative case values we had already recorded. In doing the backfill we noticed that the total had been carried over from 5/12-5/17. During this period we were unsure what was included in PA's case number, the backfill includes replacing that carried over value with the value that matches positive+negative.

Comments

jaclyde commented on August 17, 2020, 10:37 PM PDT
Open in Github

BEFORE MOVE PA Before AFTER MOVE PA After Move

jaclyde commented on August 17, 2020, 10:46 PM PDT
Open in Github

BEFORE BACKFILL PA Before Backfill AFTER Screen Shot 2020-08-17 at 10 44 38 PM

Backfill Values: 332904 327836 319832 311195 302869 295980 288858 284383 277107 270559 262788 256340 250882 245590 240641 235376 227448 221365 214883 209088 203422 198593 192935 186143 179114 171956 166851 162952 158854 153965 147373 141470 137584 133631 129792 124890 120153 113019 105602 98538 91278 83854 77771 70030 62115 54714 48232 42488 37864 33455 28005 23234 18128 12320 9494 7239 5443 4137 2842 1793 1320 975 746 268 252 181 138 104

stale[bot] commented on September 1, 2020, 11:56 PM PDT
Open in Github

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions!

#705: [PA]: PCL Cases Historical

Issue number 705

qpmnguyen opened this issue on July 30, 2020, 8:05 AM PDT

Labels Backfill Historical Data PCL/SVP Historicals not stale

Open in Github

State or US: PA

Describe the problem On 5/22, PA started breaking down total cases by confirmed and probable status. However, the value for confirmed cases was not recorded on 5/22. Using the 6pm screenshot to backfill

Link to data source Screenshots from the date (https://covid-tracking-project-data.s3.us-east-1.amazonaws.com/state_screenshots/PA/PA-20200522-184845.png)

Comments

qpmnguyen commented on July 30, 2020, 8:07 AM PDT
Open in Github

Fixing now... BEFORE: image AFTER: image

#618: [PA] Case counts after antibody testing reporting ended on 6/11

Issue number 618

brianskli opened this issue on July 13, 2020, 10:52 AM PDT

Labels Historical Data stale

Open in Github

After 6/11, PA stopped reporting antibody testing, of which positive values were included in total case calculations. A user contacted us about how that 620 was carried over/used subsequently, particularly on 6/12, when the state's bar graph showed an increase of 686 cases, but CTP showed an increase of only 66.

Front logo Front conversations

Comments

stale[bot] commented on July 28, 2020, 11:05 AM PDT
Open in Github

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions!

stale[bot] commented on August 13, 2020, 7:46 AM PDT
Open in Github

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions!

stale[bot] commented on August 23, 2020, 8:41 AM PDT
Open in Github

This issue has been closed because it was stale for 15 days, and there was no further activity on it for 10 days. You can feel free to re-open it if the issue is important, and label it as "not stale."

#347: PA's Positive counts for April 22-23 may be incorrect

Issue number 347

sgdoerfler opened this issue on May 2, 2020, 8:40 PM PDT

Open in Github

Comparing the data shown in the Positive column here to the numbers shown in the accompanying screen shots, it appears the Positive column represents Pennsylvania's total case counts, including both confirmed and probable cases.

However, on April 22nd, the table uses the figure 35045, which includes only confirmed cases, according to the screen shots, not probables. The correct figure that includes probables, and would match the other dates, is 35684. Both numbers appear on the screen shot. It appears the site just used the wrong one.

Then the next day, April 23rd, the table shows a Positive value of 36647. This doesn't seem to appear on any of the screen shots for that day. The figure that does is 37053. So I think the Positive value for April 23rd also needs correcting.

Comments

muamichali commented on May 3, 2020, 5:24 AM PDT
Open in Github

Thanks for reporting this @sgdoerfler

We updated the positives number for 4/22 and 4/23 after checking the screenshots. You should see it reflected on the site this afternoon.

4/22 PA Screenshot Evening 4/23 PA Screenshot Evening

Before Screenshot_2020-05-03 Coronavirus numbers by state (CovidTracking, DEV COPY)

After Screenshot_2020-05-03 Coronavirus numbers by state (CovidTracking, DEV COPY)(1)

#325: PA deaths neagtive on 4/23

Issue number 325

ryansarver opened this issue on April 29, 2020, 4:12 AM PDT

Labels Historical Data

Open in Github

It appears total deaths goes down by over 200 on 4/23 for Pennsylvania.

Comments

sgdoerfler commented on May 2, 2020, 7:38 PM PDT
Open in Github

The state revised their numbers. According to Lehigh Valley Live (and many other newspapers), reporting on April 23rd 2020:

The unusual change in the Pennsylvania Department of Health’s daily report is due to its inclusion of probable cases in the death count earlier this week. Per CDC guidelines, those are instances where COVID-19 is listed as a cause or contributing factor in a death where the patients had not been tested, but did have symptoms and a connection to a confirmed case.

After adding probable cases to the death toll two days ago, Health Secretary Rachel Levine on Thursday said that a number of those deaths require more information before they can be linked with the pandemic virus. She said they were removed from the count in the interest of transparency, citing the difficulty epidemiologists have handling these thousands of cases.

That resulted in the statewide coronavirus death toll dropping by 201.

#291: [States Daily CSV] PA deathIncrease is VERY negative for 2020-04-23

Issue number 291

acobolew opened this issue on April 24, 2020, 9:24 PM PDT

Open in Github

[States Daily CSV] Pennsylvania deathIncrease is VERY negative for 2020-04-23

library(data.table) covidtracking.dt <- fread('https://covidtracking.com/api/v1/states/daily.csv') covidtracking.dt[, date := as.Date(as.character(date), '%Y%m%d')] covidtracking.dt[order(date, decreasing=FALSE)][ , .(date, state, death, deathIncrease) ][deathIncrease < 0] date state death deathIncrease 1: 2020-04-05 VA 51 -1 2: 2020-04-07 DC 22 -2 3: 2020-04-22 AR 42 -1 4: 2020-04-23 PA 1421 -201 5: 2020-04-24 KS 111 -1 6: 2020-04-24 TN 168 -2

Comments

muamichali commented on April 26, 2020, 1:50 PM PDT
Open in Github

Hi @acobolew

Thanks for reporting.

As per a press conference on 4/23, "Death count: 1,421 total COVID-19 deaths, a decrease of 201 than was reported yesterday -- this reflects that the probable deaths had been removed from the count because of further investigating."

#274: [States Daily CSV] PA deaths is lower 2020-04-23 than 2020-04-22

Issue number 274

hrushka opened this issue on April 23, 2020, 3:06 PM PDT

Open in Github

Source: https://covidtracking.com/api/v1/states/daily.csv

PA, 2020-04-22: 1,622 PA, 2020-04-23: 1,421

Reference: https://c19.dev/d/4S5AN4CZz/covid-19-us-state-data?orgId=1&from=now-45d&to=now&var-state=PA&var-population=12801989&var-density=475.7437637476716&var-total_cases=36763&var-total_deaths=1720

Comments

beachhockey commented on April 24, 2020, 2:59 PM PDT
Open in Github

Even 4/24 is still lower than 4/22 with only 1,492 deaths reported.

hrushka commented on April 24, 2020, 4:39 PM PDT
Open in Github

Looks like the reason is that the deathIncrease for 2020-04-23 is negative

sgdoerfler commented on May 2, 2020, 8:27 PM PDT
Open in Github

This should be merged with issue 325, where I added an explanation already.

stale[bot] commented on May 17, 2020, 8:54 PM PDT
Open in Github

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions!

karaschechtman commented on May 20, 2020, 6:42 AM PDT
Open in Github

Hi @hrushka, Thanks for flagging. See issue 325 for an explanation. We are making a public note of the revision.

#126: PA cumulative hospitalized overcounting

Issue number 126

jdmaresco opened this issue on April 2, 2020, 6:20 PM PDT

Labels Data quality

Open in Github

From @webmasterkai

The hospitalizedCummulative data for PA goes down from 20200329 to 20200330, then does not exceed the 20200329 value again until today.

The higher, earlier numbers are not correct. We should adjust based on screen captures

image

Comments

muamichali commented on April 3, 2020, 2:44 PM PDT
Open in Github
PA Hosp Before Screen Shot 2020-04-03 at 5 38 22 PM Screen Shot 2020-04-03 at 5 38 57 PM Screen Shot 2020-04-03 at 5 39 33 PM PA Hosp After

I updated States Daily based on the screenshots.

#76: PA: cumulative hospitalizations

Issue number 76

meetar opened this issue on March 29, 2020, 12:01 PM PDT

Labels Data quality Data source stale

Open in Github

PA cumulative hospitalizations were calculated on 3/27 based on percentages on: https://www.health.pa.gov/topics/disease/coronavirus/Pages/Cases.aspx

Using a formula in the PA tab: https://docs.google.com/spreadsheets/d/1MvvbHfnjF67GnYUDJJiNYUmGco5KQ9PW0ZRnEP9ndlU/edit#gid=1393227010

KP left a private note:

(3/27 KP) used a messy dot product to calculate hospitalization numbers; see PA tab and ask me, JD, Elliot, or Jeff if needed

As well as an item in the #Reporting wishlist:

The percentages hospitalization data are crazy. Absolute numbers for hospitalization would be very helpful

On 3/29 midday I found that the same page appears to be reporting hospitalizations directly. That number is about half of the calculated number, and is ~corroborated by multiple news articles: https://www.ydr.com/story/news/2020/03/28/coronavirus-hospitalizations-pa-up-316-nearly-all-patients-icu/2933307001/ https://www.pennlive.com/coronavirus/2020/03/update-on-coronavirus-in-pennsylvania-watch-gov-wolf-dr-levine-news-conference.html

After conversation with KP and JH, they suggest that the reported cumulative number be used instead of the calculated number, though that will result in a decrease in the hospitalizations figure. https://covid-tracking.slack.com/archives/CUQ4MMTPD/p1585507764070700?thread_ts=1585504370.051800&cid=CUQ4MMTPD

Comments

meetar commented on March 29, 2020, 12:03 PM PDT
Open in Github

Also from KP:

For what it's worth, if we do the dot product calculation with the numbers on their site, we get 846 hospitalized instead of the 353 they claim.

stale[bot] commented on May 4, 2020, 6:20 PM PDT
Open in Github

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions!

stale[bot] commented on May 14, 2020, 7:30 PM PDT
Open in Github

This issue has been closed because it was stale for 15 days, and there was no further activity on it for 10 days. You can feel free to re-open it if the issue is important, and label it as "not stale."