TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

A Dive Into The Lending Club Data

84 pointsby ClementMover 9 years ago

9 comments

radmuzomover 9 years ago
I build statistical models for banks which help assess the risk of a loan. Effectively, my models will get converted into the grades (A, B, C, D, etc.) mentioned in the article. The strategies (second chance, family guy, safe haven) are generally consistent with experiences from the portfolios of most financial institutions.<p>However, I am skeptical (prove me wrong) of the statement in the article - &quot;Lenders get a return on their investment that is typically much better than traditional Certificate of Deposit or Saving Accounts&quot;. In finance terms, I will be surprised if they have a higher RAROC [1] as compared to large banks. If they really do, then congratulations (you will put banks out of business in a few years)??<p>[1] <a href="https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;Risk-adjusted_return_on_capital" rel="nofollow">https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;Risk-adjusted_return_on_capita...</a>
评论 #10259407 未加载
评论 #10260251 未加载
评论 #10259355 未加载
评论 #10260844 未加载
评论 #10259926 未加载
评论 #10259792 未加载
Amorymeltzerover 9 years ago
The employment length is really bugging me. I&#x27;ve always selected people with a few years at their current job, leaning towards higher, because it feels safe, but this says that &lt;2 years of experience is better than longer! I wonder if they&#x27;re &quot;newer&quot; so more likely to stay around and not be pushed out, or if the rates are much higher compared to a marginal increase in risk. I&#x27;m leaning toward the latter. It looks like income has the same effect for home loans; &lt;50k has a much higher return simply because they get a huge rating hit.<p>My other big hit is 3 versus 5-year terms. Anyone here care to comment? I like the 36 months because it feels more liquid and when I started I wasn&#x27;t sure LendingClub was going to be around for a decade or more. Beginning to think I should reconsider that stance.
评论 #10259685 未加载
评论 #10259000 未加载
评论 #10260435 未加载
评论 #10259264 未加载
评论 #10259165 未加载
评论 #10259362 未加载
评论 #10258951 未加载
Charlesmigliover 9 years ago
Nice post. I see that you used dc.js, could you share some code ?
评论 #10258608 未加载
fbenezitover 9 years ago
Thanks Clement for this beautifully simple dc.js dataviz. How long did you play with it before finding the pearl? Do you think there are yet other pearls to find in your tool?
评论 #10259238 未加载
rgn216over 9 years ago
Very useful tool. Gives valuable insight on how to select filters in portfolio construction. If &quot;the Pearl&quot; was a existing product, I would definitely invest in it.
评论 #10258990 未加载
rocketcityover 9 years ago
This is great. I have been using LendingRobot lately. I will have to rethink some of my strategies based on what I see in this article.
评论 #10258954 未加载
cryoshonover 9 years ago
Hm, this has sparked my interest in lending via Lending Club. I&#x27;ll check this out for sure.
akg_67over 9 years ago
I operate an online crowd-lending analytics and automation platform PeerCube https:&#x2F;www.peercube.com. I have been analyzing both Lending Club and Prosper data for my institutional clients for almost 4 years now. While OP made a good first attempt on analyzing the data, the analysis suffers from two major shortcomings that I normally see from people getting started with data analysis.<p>1. Domain Knowledge: Novice analyst tend to put the data in a blender and see what comes out first instead of building some preliminary knowledge and intuition about the domain. This is quite evident in OP&#x27;s analysis and finding about annual income. A person familiar with domain will ask the question &quot;Why would a borrower with high annual income will borrow a small amount loan at high interest rate?&quot; This right away will raise flags about risks of lending to such borrowers. OP will benefit by reading some of the publications (books, research) on credit scoring and modeling before deep diving into analyzing Lending Club data.<p>2. Data Exploration: Not spending enough time exploring the data can lead to erroneous conclusion like The second chance strategy. When did Lending Club start issuing loans to borrowers with delinquencies and public records has a big impact on returns as newer loans are not aged enough to have sufficient defaults.<p>&gt; Watch for your average return (expected return), consistency of returns through time (risk), while making sure there is enough supply (liquidity) on the platform to deploy your strategy.<p>Time is not Risk. You need to find a proper measure for risk. Also consider negative kurtosis and frequent low positive returns but a few high negative returns nature of return distribution.<p>&gt; I considered that investors deploy and re-invest their money continuously on the platform and therefore own a portfolio with different ‘vintages’ of loans. The ROI that are computed reflect this, as they are average returns across vintages.<p>Re-consider this argument of &quot;average return across vintages&quot; being representative of investor returns. Tip: look at loan volume across vintages as well as typical re-investment pattern of a typical investor.<p>&gt; Please also note than due to the low issuance volume in the early days of the platform, the returns computed for the pre-2010 period are much less reliable than the post-2010 returns.<p>Please don&#x27;t do this. The data between 2006 and 2010 is the most valuable due to the business cycle we were in at that time. The data since 2010 tells nothing about how loans might perform in the future when business cycle is not as good it has been in last few years.<p>OP will really benefit from re-evaluating his finings with critical eyes. I will suggest gaining some domain knowledge, spending lot of time on just exploring the data before start drawing definite conclusions, focusing on distributions, correlations and statistical significance.
评论 #10260506 未加载
mikeskimover 9 years ago
can you fully automate data driven investment on these platforms?
评论 #10265300 未加载