Is cash-flow based decisioning finally ready to turn the corner?
In one of my previous posts, I discussed the basic framework of doing underwriting - the data, the model, the policy, the infrastructure. Since then I have gotten a lot of questions around one specific topic - cash flow based underwriting (sometimes used interchangeably with the term open-banking) and its potential. I recently also had the opportunity to attend a symposium hosted by Nova Credit on the same topic and engage in a robust exchange of ideas and views from industry experts and practitioners (thank you Misha Esipov and Ellyse Q. ). In this month's edition of Coffee and Credit, I wanted to recap and share some of my POVs on this topic.
So what is cash flow based underwriting or decision-making?
At its core, a successful lending business for both consumer and SME lending usually rests on successfully predicting i) ability to pay, ii) willingness to pay and iii) and future stability of cash flow. Historically lending businesses have relied heavily on bureau data for underwriting. As you can imagine, when a customer has a reasonably long history with the bureaus (typically >2 years), the data is pretty accurate in predicting willingness to pay. Basically, given enough length of history, a customer or SME's past pattern of how they have handled loan payments is usually a good predictor of how they are going to behave in the future. However, accurately predicting ability to pay is an area where many lenders face gaps. Historically lenders have used self reported income combined with a picture of debt obligations from the bureaus, plus any other self reported obligations (like alimony payments, child support etc) to calculate ability to pay. For some higher ticket loans (like mortgages), there may be a more involved (albeit high-friction) doc-upload process to verify income. They also factor in the bureau data with the implicit assumption that if you had a stellar history of meeting your debt obligations in the past, you will likely also be good on your ability to pay. As you can see, all of these approaches have gaps. Not to mention, most lenders have limited or no line of sight to the stability, frequency and size of future cash flow. Enter cash-flow data! Having an accurate picture of a borrower’s cash flow - how much money is coming in, how much is going out, the destination, the frequency, the timing etc can be huge in determining ability to pay and stability of income and can help in improving both direct and marginal lending decisions, especially when you also consider that even when available, bureau data is very ‘laggy’. When used effectively, cash-flow data can be used in all types of lending decisions - approve/decline, determining the size of the loan, line assignment, pricing, account management policies like credit limit management, payment hold strategies etc. In my experience, when this data is available, it has definitely proved to be a powerful risk splitter.
So why hasn’t cash-flow based underwriting and decisioning really taken off?
Despite its seeming usefulness, less than 5% of all underwriting decisions use true automated cash flow data. In his most recent post (a must read), Jason Mikula also talks about the much broader point of how lending, despite all the tall claims about the emergence of alternate data, machine learning etc, still seems to overwhelmingly rely on bureau data for underwriting for the most part. Why is that? To understand this, let’s first start by painting a picture of the lending landscape as it stands today with various types of FIs and their relationship to cash-flow data.
So let’s dive a bit deeper into the specific challenges that FIs have to deal with when it comes to cash flow data
Data - Getting cash flow data to build models is not easy - why? Because unlike bureau data where you can go to bureaus and get an anonymized data set per your specifications and then use that to build your V.1 model, you can’t do that for cash flow. There is currently not a central repository (at least not at scale yet; though it is starting to get addressed by the likes of Akoya , Financial Data Exchange , Early Warning® etc). Also, any effort to validate or retro-score a model post build is very difficult with cash flow data. Secondly, unlike the bureau data, cash flow data is not standardized yet. This creates challenges in data ETL and running models on the data when you consider that most of the underwriting and decision-making infrastructure today is built to ingest a standard format typically from the bureaus.
Customer Experience - In the age of speed and instant gratification, asking customers to provide bank data, even simple log-in credentials, creates a lot of friction for most of the loan applications leading to drop-off in the funnel. Also, unlike bureau data where once you build the pipes and have seamless and ongoing access to the customer’s bureau data for portfolio management decisions in the future, you simply cannot do that with cash-flow data without causing significant friction.
Recommended by LinkedIn
Policies - Building policies around cash flow data may create some unintended consequences. As one of my peers recently mentioned - seniors have limited cash flow but high assets. A policy that exclusively looks at cash flow scores may inadvertently discriminate against seniors.
Infrastructure - Current data and decision infrastructures are based on bureau data and ingesting a completely orthogonal data source and integrating it with an existing infrastructure creates its own set of challenges.
And finally, the ultimate catch 22 - The Economics - Big banks have the data and the infrastructure to use cash-flow underwriting effective today and they also have a full suite of products to cross-sell and up-sell. But for them, the economics of investing in the infrastructure and product development to support segments like new to credit, recent immigrants who tend to benefit most from cash flow underwriting do not make sense - not to mention come with a lot of reputational risks. These underserved segments tend to be constantly in the radar of regulators and public officials and any missteps howsoever unintended, can cause severe damage to the reputation. On the other hand, while smaller banks and non-banks focus on serving this segment, they do not have easy access to the bank data and/or do not have the full suite of products to cross sell / upsell eventually to justify the economics.
So what does the future of cash flow based underwriting and decisioning looks like:
In the short term the cash flow will continue to see use in the following areas:
In the long term though, I see a few things happening - mostly leading to the wider and pervasive use of cash-flow data:
Suffice to say, I am bullish about cash-flow data and optimistic that in the next few years, it will become more mainstream. Companies like Nova Credit and Prism Data who have not only invested in building cash flow based scores, attributes and solutions but are also rapidly expanding their partnership with FIs on one end and third party platform providers on the other side are going to be at the forefront of making this transition. It feels like the stars are finally starting to align. If you have or are using cash flow data, what has been your experience - what are the challenges you have faced and do you agree that the next few years will see cash-flow based underwriting becoming more mainstream?
Product & Analytics Leader
1yCash flow based decisioning especially feels like a great tool to get more of the scoreless and underbanked into the system, given that a checking account usually has a lower barrier to entry for the underserved vs a lending product. I am enjoying Coffee and Credit a lot Dipanjan ‘DD’ Das - looking forward to your next post!