Environmental statistics is a rapidly growing field, supported by advances in digital computing power, automated data collection systems, and interactive, linkable Internet software. Concerns over public and ecological health and the continuing need to support environmental policy-making and regulation have driven a concurrent explosion in environmental data analysis. This textbook is designed to address the need for trained professionals in this area. The book is based on a course which the authors have taught for many years, and prepares students for careers in environmental analysis centered on statistics and allied quantitative methods of data evaluation. The text extends beyond the introductory level, allowing students and environmental science practitioners to develop the expertise to design and perform sophisticated environmental data analyses. In particular, it: Provides a coherent introduction to intermediate and advanced methods for modeling and analyzing environmental data. Takes a data-oriented approach to describing the various methods. Illustrates the methods with real-world examples Features extensive exercises, enabling use as a course text. Includes examples of SAS computer code for implementation of the statistical methods. Connects to a Web site featuring solutions to exercises, extra computer code, and additional material. Serves as an overview of methods for analyzing environmental data, enabling use as a reference text for environmental science professionals. Graduate students of statistics studying environmental data analysis will find this invaluable as will practicing data analysts and environmental scientists including specialists in atmospheric science, biology and biomedicine, chemistry, ecology, environmental health, geography, and geology.
Solutions Manual to accompany Statistical Data Analytics: Foundations for Data Mining, Informatics, and Knowledge Discovery A comprehensive introduction to statistical methods for data mining and knowledge discovery. Extensive solutions using actual data (with sample R programming code) are provided, illustrating diverse informatic sources in genomics, biomedicine, ecological remote sensing, astronomy, socioeconomics, marketing, advertising and finance, among many others.
A comprehensive introduction to statistical methods for data mining and knowledge discovery. Applications of data mining and ‘big data’ increasingly take center stage in our modern, knowledge-driven society, supported by advances in computing power, automated data acquisition, social media development and interactive, linkable internet software. This book presents a coherent, technical introduction to modern statistical learning and analytics, starting from the core foundations of statistics and probability. It includes an overview of probability and statistical distributions, basics of data manipulation and visualization, and the central components of standard statistical inferences. The majority of the text extends beyond these introductory topics, however, to supervised learning in linear regression, generalized linear models, and classification analytics. Finally, unsupervised learning via dimension reduction, cluster analysis, and market basket analysis are introduced. Extensive examples using actual data (with sample R programming code) are provided, illustrating diverse informatic sources in genomics, biomedicine, ecological remote sensing, astronomy, socioeconomics, marketing, advertising and finance, among many others. Statistical Data Analytics: Focuses on methods critically used in data mining and statistical informatics. Coherently describes the methods at an introductory level, with extensions to selected intermediate and advanced techniques. Provides informative, technical details for the highlighted methods. Employs the open-source R language as the computational vehicle – along with its burgeoning collection of online packages – to illustrate many of the analyses contained in the book. Concludes each chapter with a range of interesting and challenging homework exercises using actual data from a variety of informatic application areas. This book will appeal as a classroom or training text to intermediate and advanced undergraduates, and to beginning graduate students, with sufficient background in calculus and matrix algebra. It will also serve as a source-book on the foundations of statistical informatics and data analytics to practitioners who regularly apply statistical learning to their modern data.