In this ** Data science online training ** you will understand all basics to advanced statistics and learn how to program in R & Python and how to use R & Python for effective data analysis. You will learn how to install and configure software necessary for a statistical programming environment and describe generic programming language concepts as they are implemented in a high-level statistical language.

The **data science online training in Hyderabad** covers practical issues in statistical computing, which includes programming in R & Python, reading data into R & Python, accessing R packages & Python data science library and frameworks, writing R & Python functions, debugging, profiling R & Python code. Topics in statistical data analysis will provide working examples.

- Non-IT Professionals
- Developers
- Non-BI Professionals
- Data Analysts
- Project Managers
- Job seekers
- Graduates

career It Trainings provide **data science On line training** related software and tools.

The **Data science Online training ** course
has no pre-requisites. No prior knowledge of Statistics, the language of R, Python or analytic techniques is required. This course covers from basic to advanced Statistics and Machine Learning Techniques.

- Introduction to Data Science, Tables,Database,ETL, EDW and Data Mining
- What is Data Science?
- Popular Tools
- Role of Data Scientist
- Analytics Methodology

Statistics is concerned with the scientific method by which information is collected, organized, analyzed and interpreted for the purpose of description and decision-making.

**Descriptive Statistics ** - It deals with the presentation of numerical
facts, or data, in either tables or graphs form, and with the
methodology of analyzing the data.

**Inferential Statistics ** - It involves techniques for making inferences
about the whole population on the basis of observations obtained from
samples.

- Sample Statistics
- Estimations of Population Parameters
- Random and Non-random Sampling
- Sampling Distributions
- Degree of Freedom
- The
**Central limit Theorem**

- Mean
- Median
- Mode

- Range
- IQR
- Variance
- Standard Deviation

- Events, Sample Space and Probabilities
- Conditional Probabilities
- Independence of Events
**Baye's Theorem**

- Random Variable
- The Normal Distributions
- Confidence Intervals
- Hypothesis Testing

- Null Hypothesis
- The Significance Level
- p-value
- Type I and Type II Errors

- t test
- f test
- Z test
- Chi square test
- Student test

- ANOVA Computations
- Two-way ANOVA

- Data Summaries
- Covariance, Correlation, and Distances
- Missing Values Handling
- Outliers Handling
- Principal Component Analysis
- Exploratory Factor Analysis

- Ordinary Least Squares
- Ridge Regression
- Lasso Regression
- K Nearest Neighbours Regression & Classification

- Training Set
- Validation Set
- Test Set
- Cross-Validation

**Logistic Regression**- Linear Discriminant Analysis
- Quadratic Discriminant Analysis

- Bagging (Parallel Ensemble) - Random Forest
- Boosting (Sequential Ensemble) - Gradient Boosting

- Structure of Neural Network
- Hidden Layers and Neurons
- Weights and Transfer Function

- Trend and Seasonal Analysis
- Different Smoothing Techniques
- ARIMA Modelling
- ETS Modelling

- Hierarchical (Agglomerative) Clustering
- Non-Hierarchical Clustering: The k-Means Algorithm

- Aprori Algorithms
- Frequent Item-sets
- Support
- Confidence
- Lift Ratio
- Discovering Association Rules

- Sentiment Analysis
- Use Behaviour Analysis
- Topic Categorization
- Topic Ranking

- Collaborative Filtering Recommenders
- Content Based Recommenders

- Software Installation on Various Operating Systems
- Introduction to Real Time Applications
- Introduction to Popular Packages

- Basic Data Types
- R Data Structures
- Vectors
- Matrix
- List
- Data Frames
- R Functions
- Predictive Modelling Project based on R
- Classification Modelling Project based on R
- Clustering Project based on R
- Association Mining Project based on R
- R Visualization Packages
- Machine Learning Packages in R

- Installing Python on Windows
- Installing Python on Mac and Linux
- Introduction to Editors
- Installing PyCharm and Sublime Editors

- Numbers and Math in Python
- Variable and Inputs
- Built in Modules and Functions
- Save and Run Python Files
- Strings
- Python List
- Python slices and slicing

- Scikit-Learn
- Numpy
- Scipy
- Pandas
- Matplotlib

- Introduction to Data Science and Visualization Tools in Python
- Installing and Setting up iPython Notebook
- Installing Anaconda and Panda
- Setting Up Environment

- Creating Arrays
- Using Arrays and Scalars
- Indexing Arrays
- Array Transposition
- Universal Array Function
- Array Processing
- Array Input and Ouput

- Series
- Data Frames
- Index Objects
- Reindex
- Drop Entry
- Selecting Entries
- Data Alignment
- Rank and Sort
- Summary Statistics
- Missing Data
- Index Hierarchy

- Reading and Writing Text Files
- Json with Python
- HTML with Python
- Microsoft Excel Files with Python

- Merge, Merge on Index and Concatenate
- Combining Data Frames
- Reshaping and Pivoting
- Duplicating Data Frames
- Mapping, Replacing, Rename Index and Binning
- Outliers and Permutations

- Group by on Data Frames
- Group by on Dist Series
- Aggregation
- Splitting, Applying and Combining
- Cross Tabulation

- Installing Seaborn
- Histograms
- Kernel Density and Estimate Plots
- Combining Plot Styles
- Box and Violin Plots
- Regression Plots
- Box and Violin Plots
- Heat Maps and ClusteredMatrices
- Example Projects-15

- Introduction
- Linear Regression
- Logistic Regression
- Multi Class Classification - Logistic Regression
- Multi Class Classification - Nearest Neighbor
- Vector Machines
- Naïve Bayes Theory

- Introduction
- Analytics through designed experiments
- Analytics through Active learning
- Analytics through Reinforcement learning

- Cover couple of Real-Time Analytics Projects based on R Script and Python Scientific Libraries.

- RDD Concept
- Spark MLlib: Data Types, Algorithms, and Utilities