These are the reasons why the most sophisticated Machine Learning techniques may not perform well.Based on the paper titled “Classifier Technology and the Illusion of Progress” by David Hand.Aug 29, 2022Aug 29, 2022
Intuition of the Delta Method in StatisticsDelta method is a useful technique when we are interested in the asymptotic probability distribution of a function of random variables. In…Mar 24, 2022Mar 24, 2022
The difference between mathematical, computational, and applied statisticsDifferent subdomains require different skillsets. It is important to make an informed decisions.Mar 4, 2022Mar 4, 2022
Published inAnalytics VidhyaData Analysis: It is not as hard as it may seemBe a data analyst first and then focus on programming.Mar 3, 2022Mar 3, 2022
Published inAnalytics VidhyaTechnique initially developed for astronomy to analyze sports data.Statistical testing for sports dataSep 4, 2021Sep 4, 2021
Published inAnalytics VidhyaSome considerations for data visualization in professional settingsMajority of data visualization work in professional settings are not necessarily spectacular. One should consider a few other things.Jan 20, 2021Jan 20, 2021
Published inAnalytics VidhyaUsing spatial and activities data(with python) to understand how Messi and Ronaldo do their magic…Sports analytics have seen rapid rise in recent years due to availability of large amount of data. Books and movies like Moneyball have…Jan 11, 2021Jan 11, 2021
Permutation based feature importance for clusteringFeature selection is an important topic in machine learning. It is relevant for both supervised and unsupervised problems. At this moment…Jan 6, 2021Jan 6, 2021
Why feature selection in clustering is importantKnowingly or unknowingly we deal with groups all the time when we work with datasets. Companies do it when they want to segregate their…Dec 28, 2020Dec 28, 2020
Published inAnalytics VidhyaData wrangling project with PythonDataset: Spatio-temporal_match_events_in_soccer_competitionsDec 27, 20201Dec 27, 20201