Statistical Regression and Classification

Statistical Regression and Classification
Author: Norman Matloff
Publisher: CRC Press
Total Pages: 439
Release: 2017-09-19
Genre: Business & Economics
ISBN: 1351645897

Statistical Regression and Classification: From Linear Models to Machine Learning takes an innovative look at the traditional statistical regression course, presenting a contemporary treatment in line with today's applications and users. The text takes a modern look at regression: * A thorough treatment of classical linear and generalized linear models, supplemented with introductory material on machine learning methods. * Since classification is the focus of many contemporary applications, the book covers this topic in detail, especially the multiclass case. * In view of the voluminous nature of many modern datasets, there is a chapter on Big Data. * Has special Mathematical and Computational Complements sections at ends of chapters, and exercises are partitioned into Data, Math and Complements problems. * Instructors can tailor coverage for specific audiences such as majors in Statistics, Computer Science, or Economics. * More than 75 examples using real data. The book treats classical regression methods in an innovative, contemporary manner. Though some statistical learning methods are introduced, the primary methodology used is linear and generalized linear parametric models, covering both the Description and Prediction goals of regression methods. The author is just as interested in Description applications of regression, such as measuring the gender wage gap in Silicon Valley, as in forecasting tomorrow's demand for bike rentals. An entire chapter is devoted to measuring such effects, including discussion of Simpson's Paradox, multiple inference, and causation issues. Similarly, there is an entire chapter of parametric model fit, making use of both residual analysis and assessment via nonparametric analysis. Norman Matloff is a professor of computer science at the University of California, Davis, and was a founder of the Statistics Department at that institution. His current research focus is on recommender systems, and applications of regression methods to small area estimation and bias reduction in observational studies. He is on the editorial boards of the Journal of Statistical Computation and the R Journal. An award-winning teacher, he is the author of The Art of R Programming and Parallel Computation in Data Science: With Examples in R, C++ and CUDA.

Modern Multivariate Statistical Techniques

Modern Multivariate Statistical Techniques
Author: Alan J. Izenman
Publisher: Springer Science & Business Media
Total Pages: 757
Release: 2009-03-02
Genre: Mathematics
ISBN: 0387781897

This is the first book on multivariate analysis to look at large data sets which describes the state of the art in analyzing such data. Material such as database management systems is included that has never appeared in statistics books before.

Predictive Analytics

Predictive Analytics
Author: Ajit C. Tamhane
Publisher: John Wiley & Sons
Total Pages: 384
Release: 2020-10-13
Genre: Mathematics
ISBN: 1118948890

Provides a foundation in classical parametric methods of regression and classification essential for pursuing advanced topics in predictive analytics and statistical learning This book covers a broad range of topics in parametric regression and classification including multiple regression, logistic regression (binary and multinomial), discriminant analysis, Bayesian classification, generalized linear models and Cox regression for survival data. The book also gives brief introductions to some modern computer-intensive methods such as classification and regression trees (CART), neural networks and support vector machines. The book is organized so that it can be used by both advanced undergraduate or masters students with applied interests and by doctoral students who also want to learn the underlying theory. This is done by devoting the main body of the text of each chapter with basic statistical methodology illustrated by real data examples. Derivations, proofs and extensions are relegated to the Technical Notes section of each chapter, Exercises are also divided into theoretical and applied. Answers to selected exercises are provided. A solution manual is available to instructors who adopt the text. Data sets of moderate to large sizes are used in examples and exercises. They come from a variety of disciplines including business (finance, marketing and sales), economics, education, engineering and sciences (biological, health, physical and social). All data sets are available at the book’s web site. Open source software R is used for all data analyses. R codes and outputs are provided for most examples. R codes are also available at the book’s web site. Predictive Analytics: Parametric Models for Regression and Classification Using R is ideal for a one-semester upper-level undergraduate and/or beginning level graduate course in regression for students in business, economics, finance, marketing, engineering, and computer science. It is also an excellent resource for practitioners in these fields.

An Introduction to Statistical Learning

An Introduction to Statistical Learning
Author: Gareth James
Publisher: Springer Nature
Total Pages: 617
Release: 2023-08-01
Genre: Mathematics
ISBN: 3031387473

An Introduction to Statistical Learning provides an accessible overview of the field of statistical learning, an essential toolset for making sense of the vast and complex data sets that have emerged in fields ranging from biology to finance, marketing, and astrophysics in the past twenty years. This book presents some of the most important modeling and prediction techniques, along with relevant applications. Topics include linear regression, classification, resampling methods, shrinkage approaches, tree-based methods, support vector machines, clustering, deep learning, survival analysis, multiple testing, and more. Color graphics and real-world examples are used to illustrate the methods presented. This book is targeted at statisticians and non-statisticians alike, who wish to use cutting-edge statistical learning techniques to analyze their data. Four of the authors co-wrote An Introduction to Statistical Learning, With Applications in R (ISLR), which has become a mainstay of undergraduate and graduate classrooms worldwide, as well as an important reference book for data scientists. One of the keys to its success was that each chapter contains a tutorial on implementing the analyses and methods presented in the R scientific computing environment. However, in recent years Python has become a popular language for data science, and there has been increasing demand for a Python-based alternative to ISLR. Hence, this book (ISLP) covers the same materials as ISLR but with labs implemented in Python. These labs will be useful both for Python novices, as well as experienced users.

Classification and Regression Trees

Classification and Regression Trees
Author: Leo Breiman
Publisher: Routledge
Total Pages: 370
Release: 2017-10-19
Genre: Mathematics
ISBN: 135146048X

The methodology used to construct tree structured rules is the focus of this monograph. Unlike many other statistical procedures, which moved from pencil and paper to calculators, this text's use of trees was unthinkable before computers. Both the practical and theoretical sides have been developed in the authors' study of tree methods. Classification and Regression Trees reflects these two sides, covering the use of trees as a data analysis method, and in a more mathematical framework, proving some of their fundamental properties.

Interpretable Machine Learning

Interpretable Machine Learning
Author: Christoph Molnar
Publisher: Lulu.com
Total Pages: 320
Release: 2020
Genre: Computers
ISBN: 0244768528

This book is about making machine learning models and their decisions interpretable. After exploring the concepts of interpretability, you will learn about simple, interpretable models such as decision trees, decision rules and linear regression. Later chapters focus on general model-agnostic methods for interpreting black box models like feature importance and accumulated local effects and explaining individual predictions with Shapley values and LIME. All interpretation methods are explained in depth and discussed critically. How do they work under the hood? What are their strengths and weaknesses? How can their outputs be interpreted? This book will enable you to select and correctly apply the interpretation method that is most suitable for your machine learning project.

Swarm Intelligence Methods for Statistical Regression

Swarm Intelligence Methods for Statistical Regression
Author: Soumya Mohanty
Publisher: CRC Press
Total Pages: 77
Release: 2018-12-14
Genre: Business & Economics
ISBN: 1351365029

A core task in statistical analysis, especially in the era of Big Data, is the fitting of flexible, high-dimensional, and non-linear models to noisy data in order to capture meaningful patterns. This can often result in challenging non-linear and non-convex global optimization problems. The large data volume that must be handled in Big Data applications further increases the difficulty of these problems. Swarm Intelligence Methods for Statistical Regression describes methods from the field of computational swarm intelligence (SI), and how they can be used to overcome the optimization bottleneck encountered in statistical analysis. Features Provides a short, self-contained overview of statistical data analysis and key results in stochastic optimization theory Focuses on methodology and results rather than formal proofs Reviews SI methods with a deeper focus on Particle Swarm Optimization (PSO) Uses concrete and realistic data analysis examples to guide the reader Includes practical tips and tricks for tuning PSO to extract good performance in real world data analysis challenges

Bayesian Methods for Nonlinear Classification and Regression

Bayesian Methods for Nonlinear Classification and Regression
Author: David G. T. Denison
Publisher: John Wiley & Sons
Total Pages: 302
Release: 2002-05-06
Genre: Mathematics
ISBN: 9780471490364

Bei der Regressionsanalyse von Datenmaterial erhält man leider selten lineare oder andere einfache Zusammenhänge (parametrische Modelle). Dieses Buch hilft Ihnen, auch komplexere, nichtparametrische Modelle zu verstehen und zu beherrschen. Stärken und Schwächen jedes einzelnen Modells werden durch die Anwendung auf Standarddatensätze demonstriert. Verbreitete nichtparametrische Modelle werden mit Hilfe von Bayes-Verfahren in einen kohärenten wahrscheinlichkeitstheoretischen Zusammenhang gebracht.

The Statistical Evaluation of Medical Tests for Classification and Prediction

The Statistical Evaluation of Medical Tests for Classification and Prediction
Author: Margaret Sullivan Pepe
Publisher: OUP Oxford
Total Pages: 319
Release: 2003-03-13
Genre: Medical
ISBN: 019158861X

This book describes statistical techniques for the design and evaluation of research studies on medical diagnostic tests, screening tests, biomarkers and new technologies for classification and prediction in medicine.