HCCs: The Role of CDI and Risk Scores

By Frank Cohen, MPA
January 30, 2023

Predicting coding patterns using the HCC risk scores can be a valuable endeavor.

EDITOR’S NOTE: Longtime RACmonitor contributing correspondent Frank Cohen, a senior healthcare analyst, is sharing his thoughts on a recent study he conducted of Hierarchical Condition Categories (HCCs) that revealed for the first time the likelihood of over- and under-coding by providers.

You can use the following information to help mitigate the risks of engaging in overzealous clinical documentation integrity (CDI) projects for providers who may already be optimizing their diagnosis coding – while at the same time, identifying those physicians who most likely need your assistance.

Since 2000, the Centers for Medicare & Medicaid Services (CMS) has implemented a health-based risk adjustment model. The purpose is to estimate the relative health status of a given beneficiary. This is done by factoring in diagnosis codes associated to care received and certain demographic information about each beneficiary. In 2004, CMS created a new risk model, the Hierarchical Condition Categories (HCC) model, which adjusts Medicare capitation payments to Medicare Advantage (MA) organizations for the variation in health expenditure risk of enrollees in their plans.

In general, the higher the HCC risk score, the ”sicker” the patient, the higher the estimated costs, and subsequently, the higher the payment rate to the MA organization. Because a significant portion of the risk score was calculated based on the number of diseases or conditions (excluding ESRD, end-stage renal disease) reported for a patient in the form of ICD codes, proper coding became a core concern among MA organizations and providers. Basically, the higher the score, the more the MA organization gets paid, and so, all of a sudden, payors became interested in how providers were documenting and coding for patient visits – and not for the reasons one might think. This wasn’t to audit the provider, or to see whether their documentation supported the payment (as in a recoupment audit), but to see whether the provider documented and coded such that they were maximizing (or maybe optimizing) the risk score. Remember, higher risk score, higher payments. It’s not difficult to see the incentive here.

Out of this was born a niche market called CDI. According to the American Academy of Professional Coders (AAPC), “clinical documentation is the information a person responsible for a patient’s medical care enters in a medical record, which is a repository for an individual’s health information. The entries contained in the medical record may be authored by a physician, dentist, chiropractor, or other healthcare professional. Regulations, accreditation requirements, internal policies, and other rules may define who is allowed to document in the medical record in specific cases.”

They go on to define CDI as: “the process of reviewing medical record documentation for completeness and accuracy. CDI includes a review of disease process, diagnostic findings, and what the documentation might be missing.”

Reading the above paragraph, one could easily conclude, then, that CDI is a good thing, and CDI consultants are helping to improve the overall quality of care that a patient receives. But the reality is that there are some CDI consultants and programs that are more focused on improving payment than the quality of care. I know you are getting tired of reading this, but higher scores, higher payment. And a major component of the risk score calculation is the number of diseases or conditions reported in the patient chart. The reason I know this is a problem is the number of lawsuits the government has filed against providers, consultants, and MA organizations for abusing the HCC risk adjustment program. I did a cursory search and found 27 active lawsuits over this issue.

One of those involves a whistleblower lawsuit wherein the whistleblower (Kathy Ormsby) claimed that the organization (Palo Alto Medical Foundation and Sutter Health) was inflating the number of ICD codes reported in a patient’s record in order to increase the amount of payment under their MA plan. The Government stated that their investigation confirmed what Ormsby had claimed; the organization systematically added false diagnosis codes to the records of their patients. In fact, one audit showed that some 90 percent of all cancer diagnoses were invalid. That same audit also found that 96 percent of stroke diagnoses and 66 percent of diagnoses for fractures were invalid (or falsified).

Another big one was brought against UnitedHealth Group (UHG) by the Government for basically the same complaint: falsifying patient records to increase the risk score and subsequently increasing payments. In this September 2017 case, the Government alleged the following: “in particular, the lawsuit contends that UHG funded chart reviews conducted by HealthCare Partners (HCP), one of the largest providers of services to UHG beneficiaries in California, to increase the risk adjustment payments received from the Medicare program for beneficiaries under HCP’s care. According to the case laid forth by the DOJ (U.S. Department of Justice), UHG allegedly ignored information from these chart reviews about invalid diagnoses, and thus avoided repaying Medicare monies to which it was not entitled.”

So, how do you know whether a provider (or at least the patient charts) lean towards over-coding? The ideal method would be an audit. The problem is that you can’t audit all of a provider’s charts. One alternative is to draw a statistically valid random sample and then extrapolate the results to the universe of claims submitted by that provider. I’m all for that. But how do you know which providers to audit?

And this brings us back to the realm of risk-based auditing. Except in this case, rather than trying to determine the risk of an audit, we are trying to predict whether the provider’s charts accurately reflect the reality of the composite visits. The real question, then, is this: is there some way to estimate (or predict) whether a provider may be under-coding or over-coding, based on some benchmark; and the answer is “probably.” The following is what I did to test this.

First, I imported the data from the most recent Public Use File (PUF), which contains National Provider Identifier (NPI) numbers for over a million physicians in the United States (including Puerto Rico, Guam, and the Federated States of Micronesia). Then I did a whole bunch of filtering, and what I ended up with was a pretty solid database that contained, among other things, unique beneficiaries, total Medicare payments, and the average HCC score by provider. In total, I used these data for around 780,000 providers in 57 different specialties.

From these data, I created a table that reported, by specialty, the average risk score, along with the median standard deviation and inter-quartile range. I then created both a high- and a low-risk indicator for each of those specialties. While what I did was a bit more complicated, in general, I multiplied the standard deviation by two and then subtracted from the mean to get the low risk score, and added it to the mean to get the high risk indicator. Going back to the data set with all of the providers, I would simply test their average risk score (based on their specialty) against the high and low thresholds I created in the summary table. If that provider’s average risk score was below the lower threshold, I would tag that provider as a potential under-coder. If their average risk score was above the upper threshold, I would tag that provider as an over-coder. For example, for cardiology, the lower threshold is 1.139 and the upper threshold is 2.837. Picking a cardiologist at random from the data, I see that their average risk score (for their entire patient population) is 1.097. Since this is below the lower threshold (1.139), I would label this provider as a potential under-coder. For another provider, I get an average risk score of 3.235, which is higher than the upper threshold, and as such, I would label this provider a potential over-coder.

To confirm the validity of my findings, I compared those indicators against the percentile ranking for each of those providers. The reason I did this was because the above method is most accurate for a normal distribution, but the data points were not normally distributed. Note that the percentile rankings were also specialty-specific. I was actually a bit surprised at the results of this test, as those that I tagged as potential over-coders had percentiles that ranged from 85^th to 100^th, meaning that it would be reasonable to investigate those providers that met either criteria. For the lower threshold, however, the percentile range was far narrower: from 1^st to 9^th. My conclusion is that the upper test is likely more accurate than the lower test.

What I didn’t do was conduct actual chart audits on these providers, so I can’t say with any degree of certainty that a provider I call an over-coder is actually over-coding (and the same for what I call an under-coder). The value I see in this type of analysis is the ability to prioritize work effort by looking at those that are most likely (predicted) to be either under- or over-coding. Under-coders have the opportunity to improve their clinical documentation to increase their risk scores (legitimately), thereby increasing payments. For over-coders, a risk-based approach is needed to determine if they are, in fact, over-coding.

Whatever the approach, I continue to work toward means and methods that will help to improve the efficiency and accuracy with which healthcare organizations document, code, and bill for their services. And ultimately, I would think that our goals would align with the opportunity to not only improve under-coding docs, but to mitigate the risk that potential over-coding docs face with respect to Government and third-party audits. We have the technology (and sometimes the data) to achieve amazing results.

I guess in the end, it’s just a matter of how much effort we want to expend in the front end to mitigate damages in the back end. According to Benjamin Franklin, “an ounce of prevention is worth a pound of cure.”

And that’s the world according to Frank.

TAGS: Clinical Documentation, CMS, HCC, ICD-10, Medicare

Frank Cohen, MPA

Frank D. Cohen is Senior Director of Analytics and Business Intelligence at VMG Health, LLC, and is Chief Statistician for Advanced Healthcare Analytics. He has served as a testifying expert witness in more than 300 healthcare compliance litigation matters spanning nearly five decades in computational statistics and predictive analytics.

Featured Webcasts

Sepsis Sequencing in Focus: From Documentation to Defensible Coding

Sepsis sequencing continues to challenge even experienced coding and CDI professionals, with evolving guidelines, documentation gaps, and payer scrutiny driving denials and data inconsistencies. In this webcast, Payal Sinha, MBA, RHIA, CCDS, CDIP, CCS, CCS-P, CCDS-O, CRC, CRCR, provides clear guideline-based strategies to accurately code sepsis, severe sepsis, and septic shock, assign POA indicators, clarify the relationship between infection and organ dysfunction, and align documentation across teams. Attendees will gain practical tools to strengthen audit defensibility, improve first-pass accuracy, support appeal success, reduce denials, and ensure accurate quality reporting, empowering organizations to achieve consistent, compliant sepsis coding outcomes.

March 26, 2026

Fracture Care Coding: Reduce Denials Through Accurate Coding, Sequencing, and Modifier Use

Expert presenters Kathy Pride, RHIT, CPC, CCS-P, CPMA, and Brandi Russell, RHIA, CCS, COC, CPMA, break down complex fracture care coding rules, walk through correct modifier application (-25, -57, 54, 55), and clarify sequencing for initial and subsequent encounters. Attendees will gain the practical knowledge needed to submit clean claims, ensure compliance, and stay one step ahead of payer audits in 2026.

February 24, 2026

Mastering Principal Diagnosis: Coding Precision, Medical Necessity, and Quality Impact

Accurately determining the principal diagnosis is critical for compliant billing, appropriate reimbursement, and valid quality reporting — yet it remains one of the most subjective and error-prone areas in inpatient coding. In this expert-led session, Cheryl Ericson, RN, MS, CCDS, CDIP, demystifies the complexities of principal diagnosis assignment, bridging the gap between coding rules and clinical reality. Learn how to strengthen your organization’s coding accuracy, reduce denials, and ensure your documentation supports true medical necessity.

December 3, 2025

Proactive Denial Management: Data-Driven Strategies to Prevent Revenue Loss

Denials continue to delay reimbursement, increase administrative burden, and threaten financial stability across healthcare organizations. This essential webcast tackles the root causes—rising payer scrutiny, fragmented workflows, inconsistent documentation, and underused analytics—and offers proven, data-driven strategies to prevent and overturn denials. Attendees will gain practical tools to strengthen documentation and coding accuracy, engage clinicians effectively, and leverage predictive analytics and AI to identify risks before they impact revenue. Through real-world case examples and actionable guidance, this session empowers coding, CDI, and revenue cycle professionals to shift from reactive appeals to proactive denial prevention and revenue protection.

November 25, 2025

Featured Webcasts

Mastering MDM for Accurate Professional Fee Coding

In this timely session, Stacey Shillito, CDIP, CPMA, CCS, CCS-P, CPEDC, COPC, breaks down the complexities of Medical Decision Making (MDM) documentation so providers can confidently capture the true complexity of their care. Attendees will learn practical, efficient strategies to ensure documentation aligns with current E/M guidelines, supports accurate coding, and reduces audit risk, all without adding to charting time.

March 31, 2026

The PEPPER Returns – Risk and Opportunity at Your Fingertips

Join Ronald Hirsch, MD, FACP, CHCQM for The PEPPER Returns – Risk and Opportunity at Your Fingertips, a practical webcast that demystifies the PEPPER and shows you how to turn complex claims data into actionable insights. Dr. Hirsch will explain how to interpret key measures, identify compliance risks, uncover missed revenue opportunities, and understand new updates in the PEPPER, all to help your organization stay ahead of audits and use this powerful data proactively.

March 19, 2026

Top 10 Audit Targets for 2026-2027 for Hospitals & Physicians: Protect Your Revenue

Stay ahead of the 2026-2027 audit surge with “Top 10 Audit Targets for 2026-2027 for Hospitals & Physicians: Protect Your Revenue,” a high-impact webcast led by Michael Calahan, PA, MBA. This concise session gives hospitals and physicians clear insight into the most likely federal audit targets, such as E/M services, split/shared and critical care, observation and admissions, device credits, and Two-Midnight Rule changes, and shows how to tighten documentation, coding, and internal processes to reduce denials, recoupments, and penalties. Attendees walk away with practical best practices to protect revenue, strengthen compliance, and better prepare their teams for inevitable audits.

January 29, 2026

AI in Claims Auditing: Turning Compliance Risks into Defensible Systems

As AI reshapes healthcare compliance, the risk of biased outputs and opaque decision-making grows. This webcast, led by Frank Cohen, delivers a practical Four-Pillar Governance Framework—Transparency, Accountability, Fairness, and Explainability—to help you govern AI-driven claim auditing with confidence. Learn how to identify and mitigate bias, implement robust human oversight, and document defensible AI review processes that regulators and auditors will accept. Discover concrete remedies, from rotation protocols to uncertainty scoring, and actionable steps to evaluate vendors before contracts are signed. In a regulatory landscape that moves faster than ever, gain the tools to stay compliant, defend your processes, and reduce liability while maintaining operational effectiveness.

January 13, 2026

HCCs: The Role of CDI and Risk Scores

Frank Cohen, MPA

Related Stories

The Next Phase of Medical Record Expectations: How Evolving Review Standards and Digital Interoperability Will Reshape Physician Documentation

In the Crosshairs: Aetna’s Severity Policy

Leave a Reply

Featured Webcasts

Sepsis Sequencing in Focus: From Documentation to Defensible Coding

Fracture Care Coding: Reduce Denials Through Accurate Coding, Sequencing, and Modifier Use

Mastering Principal Diagnosis: Coding Precision, Medical Necessity, and Quality Impact

Proactive Denial Management: Data-Driven Strategies to Prevent Revenue Loss

Trending News

Telehealth Coverage Extended, Meaning the Need to Get it Right Becomes Key

Trust, But Verify – The Pitfalls of Relying Too Heavily on Experts

Happy Mardi Gras – “Laissez les bons temps rouler!”

Featured Webcasts

Mastering MDM for Accurate Professional Fee Coding

The PEPPER Returns – Risk and Opportunity at Your Fingertips

Top 10 Audit Targets for 2026-2027 for Hospitals & Physicians: Protect Your Revenue

AI in Claims Auditing: Turning Compliance Risks into Defensible Systems

Trending News

The Elimination of the Inpatient-Only List: Why It Matters

What OIG’s New Medicare Advantage Guidance Means for CDI, Coding, and Physician Advisors

Docs Take a Hit: What is CMS Signaling About Physician Work Valuation

Heart Month 2026: Letter From The Publisher

Stay Connected

News

Account

Info