TechEcho

7 comments

wjn0almost 7 years ago

I found this line confusing:> The printed lines above show that both algorithms capture more than 50% of the variance exhibited in the data using only 4 of the 50 stocks.Based on the sklearn PCA documentation [1] this has nothing to do with the coefficients on individual stocks, and for PCA should read more like: "[...] capture more than 50% of the variance exhibited in the data using only 4 components [...]" which is not the same thing.1. <a href="http://scikit-learn.org/stable/modules/generated/sklearn.decomposition.PCA.html" rel="nofollow">http://scikit-learn.org/stable/modules/generated/sklearn.dec...</a>

评论 #17750182 未加载

评论 #17750202 未加载

samfisher83almost 7 years ago

Does it even makes sense to run PCA on the change percentage of a stock. To me it would be make more sense to use it with physical properties of the under lying the company. PCA helps you reduce dimensions of a higher order dimension to lower dimension so you can group stocks together. I am a little confused by what the author is trying to do.

评论 #17750036 未加载

评论 #17751040 未加载

thanatropismalmost 7 years ago

I wish people were better acquainted with the literature, e.g. <a href="https://www.nowpublishers.com/article/Details/ECO-002" rel="nofollow">https://www.nowpublishers.com/article/Details/ECO-002</a>(Ed: yeah, that's just a sample of the book but has a large bibliography at the end.)

rubatugaalmost 7 years ago

I can't seem to make the COD reach 1.0<pre><code> >>> selector.ordered_cods [0.43298218, ... , 0.5068577, 0.5068577] </code></pre> Would you think this a problem/bug?

评论 #17753811 未加载

squigs25almost 7 years ago

Another technique for unsupervised feature selection is Principal Feature Analysis (PFA): <a href="http://venom.cs.utsa.edu/dmz/techrep/2007/CS-TR-2007-011.pdf" rel="nofollow">http://venom.cs.utsa.edu/dmz/techrep/2007/CS-TR-2007-011.pdf</a>

octopodalmost 7 years ago

This dataset could be interesting as it consists of stocks and cryptos <a href="https://vectorspace.ai/recommend/datasets" rel="nofollow">https://vectorspace.ai/recommend/datasets</a>

closedalmost 7 years ago

This title seems a bit confusing, since PCA is a form of unsupervised feature selection (or rather, feature weighting).The title seems like it has the form "<Specific method> vs <Broader category method fits in>".

评论 #17751625 未加载

7 comments

wjn0almost 7 years ago

评论 #17750182 未加载

评论 #17750202 未加载

samfisher83almost 7 years ago

评论 #17750036 未加载

评论 #17751040 未加载

thanatropismalmost 7 years ago

rubatugaalmost 7 years ago

I can't seem to make the COD reach 1.0<pre><code> >>> selector.ordered_cods [0.43298218, ... , 0.5068577, 0.5068577] </code></pre> Would you think this a problem/bug?

评论 #17753811 未加载

squigs25almost 7 years ago

octopodalmost 7 years ago

This dataset could be interesting as it consists of stocks and cryptos <a href="https://vectorspace.ai/recommend/datasets" rel="nofollow">https://vectorspace.ai/recommend/datasets</a>

closedalmost 7 years ago

评论 #17751625 未加载

Linear compression in Python: PCA vs unsupervised feature selection

7 comments

Linear compression in Python: PCA vs unsupervised feature selection

7 comments