Project Notebooks

My projects are organized into following notebooks.

img statistics1

Statistics - The intuitive way

The impressive and complicated models frequently showcased as triumph of cutting edge advancement in AI and computer science on the heart relies on the principles and theorems developed in statistics and mathematics. Therefore one of my top priority projects is to have clear idea on concepts like sampling distribution, hypothesis testing, analysis of variance and experimental design.

 Mar 13, 2020

#Statistics


Read more
Read less

Statistics

Troy Data Science meme

  One of the struggles I had while learning statistic was to build intuitive thinking about fundamental concepts of statistics like inference, hypothesis testing, and probability distribution. This is where I adopted the, learn by experimenting method. It is really hard to build intuition just by seeing one or two datasets presented in a textbook. Being a student of science, I wanted to experiment every statement and theories discussed in a textbook. The best tool I found to do so is R, which is basically a metaverse for statistics. Every theories in statistics can be tested in R by generating some dummy data. It has support for any type of distribution and we can simulate any kind of sample. Doing so helped me skip theoretical proof of theorems in statistics, but still get clear picture of what the equation is doing. I can generate thousands of samples in R and see the distribution of sample means and variance which will be very close to the one given by the mathematical equation. This approach helped me a lot to learn both statistics and R. So, I want to share my notes with everyone who wants to learn statistics with minimum mathematics. All my notes on category of Statistics are my attempts to explain each topics with as much experiment as possible.

  1. Principle of inference
  2. Mean and Variance
  3. Permutation and combination
  4. Probability theory
  5. Statistical distribution
    5.1 Binomial distribution
    5.2 Normal Distribution (t-distribution, \(\chi^2\), F- distribution)
  6. Hypothesis testing
  7. ANOVA (One-way and Factorial)
  8. Regression (Simple, multiple)
  9. Degree of freedom
Read less
img Rstudio

Visualization in R

Re-visualize some famous plots using ggplot2

 Apr 23, 2020

#R-programming


Read more
Read less

GGplot2 for visualization

Data visualization can be done by using multiple tools like excel, ggplot, js or other. GGplot is one of the most flexible and powerful tool.

Read less
img python

Learning Python for R users

Tidy way to learn python

 May 27, 2021

#Python


Read more
Read less

Background

Different people start learning to code at different time point in there career with different goals. I myself had started coding because …

Read less
img bioinformatics

Bioinformatics projects

DGCA, MEGENA, and rna seq

 Jun 8, 2021

#Bioinformatics


Read more
Read less

These are the there projects I am working on now.

  1. DGCA
  2. MEGENA
  3. EDGER
Read less