Download Free Mineria De Texto Con R Book in PDF and EPUB Free Download. You can read online Mineria De Texto Con R and write the review.

Chapter 7. Case Study : Comparing Twitter Archives; Getting the Data and Distribution of Tweets; Word Frequencies; Comparing Word Usage; Changes in Word Use; Favorites and Retweets; Summary; Chapter 8. Case Study : Mining NASA Metadata; How Data Is Organized at NASA; Wrangling and Tidying the Data; Some Initial Simple Exploration; Word Co-ocurrences and Correlations; Networks of Description and Title Words; Networks of Keywords; Calculating tf-idf for the Description Fields; What Is tf-idf for the Description Field Words?; Connecting Description Fields to Keywords; Topic Modeling.
e-Research y español LE/L2: Investigar en la era digital es el primer volumen que aborda de manera conjunta las aportaciones al español LE/L2 de la lingüística de corpus, la biblioteconomía y la edición digital. Es excelente para mejorar las técnicas de investigación a la vez que se toma conciencia sobre el uso de las tecnologías en los estudios sobre el español LE/L2. Características principales: visión interdisciplinar e internacional a partir del trabajo de expertos que ejercen su actividad docente, investigadora y profesional en diferentes ámbitos y en distintos países; planteamiento teórico-práctico mediante la exposición de una reflexión teórica y la descripción de casos prácticos; sólido marco teórico que se presenta en los dos primeros capítulos; estructura homogénea dividida en útiles apartados (necesidades, cómo ayudan las tecnologías y casos concretos) para que el lector pueda localizar los contenidos con facilidad; lectura del volumen que puede ser lineal (capítulo tras capítulo) o transversal (por ejemplo, los casos prácticos que se presentan en cada capítulo); materiales complementarios en línea, como, por ejemplo, glosario hipertextual y enlaces a los corpus y programas mencionados en los capítulos. Escrito en español, de manera clara y accesible, y con abundantes ejemplos e ilustraciones, e-Research y español LE/L2: Investigar en la era digital es ideal para todas aquellas personas vinculadas con la investigación en torno al español LE/L2: estudiantes de máster y doctorado, directores de tesis (PhD o máster) y profesores. e-Research y español LE/L2: Investigar en la era digital is the first volume that jointly addresses the contributions of corpus linguistics, librarianship and digital publishing to Spanish as a second or foreign language (LE/L2). It is excellent for improving research techniques while raising awareness about the use of technologies in studies of Spanish LE/L2. Main features: interdisciplinary and international vision based on the work of experts who carry out their teaching, research and professional activities in different fields and in different countries; theoretical-practical approach through the presentation of a theoretical reflection and the description of practical cases; solid theoretical framework which is presented in the first two chapters; each chapter is divided into three useful sections (needs, how technologies help, and specific cases) so that the reader can easily locate the contents; reading can be linear (chapter by chapter) or transversal (for example, the practical cases presented in each chapter); supplementary online materials include a hypertext glossary and links to the corpus and programs mentioned in the chapters. Written in Spanish, in a clear and accessible way, and with abundant examples and illustrations, e-Research y español LE/L2: Investigar en la era digital is ideal for all those involved in research on Spanish LE/L2, master's and doctoral students, thesis supervisors and professors.
Digital preservation is an issue faced by practitioners in Ross Harveythe library and recordkeeping professions, yet most professionalshave little time to keep up with the latest techniquesand standards. This invaluable work provides a single-volume introduction to the principles, strategies and practices currently applied by librarians and recordkeepers to the preservation of digital information and will assist them to make informed decisions about the role of digital information in their care. The book is presented in four parts: Why do we preserve? What do we preserve? How do we preserve? and How do we manage digital preservation? Each part covers the area in detail and addresses current issues in a clear and informative manner. The terminology of the field is explained clearly throughout the book. Each chapter includes a range of case studies from institutionsat the forefront of digital object preservation. An index facilitates quick access. This book will be essential as a professional reference tool for all librarians, recordkeepers and archivists with preservation responsibilities as well as being a definitive source of information for the whole profession including students.
This book constitutes the proceedings of the XVI Multidisciplinary International Congress on Science and Technology (CIT 2021), held in Quito, Ecuador, on June 14–18, 2021, proudly organized by Universidad de las Fuerzas Armadas ESPE in collaboration with GDEON. CIT is an international event with a multidisciplinary approach that promotes the dissemination of advances in science and technology research through the presentation of keynote conferences. In CIT, theoretical, technical, or application works that are research products are presented to discuss and debate ideas, experiences, and challenges. Presenting high-quality, peer-reviewed papers, the book discusses the following topics: Artificial Intelligence Computational Modeling Data Communications Defense Engineering Innovation, Technology, and Society Managing Technology & Sustained Innovation, and Business Development Security and Cryptography Software Engineering
The Enciclopedia de Linguistica Hispánica provides comprehensive coverage of the major and subsidiary fields of Spanish linguistics. Entries are extensively cross-referenced and arranged alphabetically within three main sections: Part 1 covers linguistic disciplines, approaches and methodologies. Part 2 brings together the grammar of Spanish, including subsections on phonology, morphology, syntax and semantics. Part 3 brings together the historical, social and geographical factors in the evolution of Spanish. Drawing on the expertise of a wide range of contributors from across the Spanish-speaking world the Enciclopedia de Linguistica Hispánica is an indispensable reference for undergraduate and postgraduate students of Spanish, and for anyone with an academic or professional interest in the Spanish language/Spanish linguistics.
A hands on guide to web scraping and text mining for both beginners and experienced users of R Introduces fundamental concepts of the main architecture of the web and databases and covers HTTP, HTML, XML, JSON, SQL. Provides basic techniques to query web documents and data sets (XPath and regular expressions). An extensive set of exercises are presented to guide the reader through each technique. Explores both supervised and unsupervised techniques as well as advanced techniques such as data scraping and text management. Case studies are featured throughout along with examples for each technique presented. R code and solutions to exercises featured in the book are provided on a supporting website.
This book examines common tasks performed by business analysts and helps the reader navigate the wealth of information in R and its 4000 packages to create useful analytics applications. Includes interviews with corporate users of R, and easy-to-use examples.
This three volume set (CCIS 853-855) constitutes the proceedings of the 17th International Conference on Information Processing and Management of Uncertainty in Knowledge-Based Systems, IPMU 2017, held in Cádiz, Spain, in June 2018. The 193 revised full papers were carefully reviewed and selected from 383 submissions. The papers are organized in topical sections on advances on explainable artificial intelligence; aggregation operators, fuzzy metrics and applications; belief function theory and its applications; current techniques to model, process and describe time series; discrete models and computational intelligence; formal concept analysis and uncertainty; fuzzy implication functions; fuzzy logic and artificial intelligence problems; fuzzy mathematical analysis and applications; fuzzy methods in data mining and knowledge discovery; fuzzy transforms: theory and applications to data analysis and image processing; imprecise probabilities: foundations and applications; mathematical fuzzy logic, mathematical morphology; measures of comparison and entropies for fuzzy sets and their extensions; new trends in data aggregation; pre-aggregation functions and generalized forms of monotonicity; rough and fuzzy similarity modelling tools; soft computing for decision making in uncertainty; soft computing in information retrieval and sentiment analysis; tri-partitions and uncertainty; decision making modeling and applications; logical methods in mining knowledge from big data; metaheuristics and machine learning; optimization models for modern analytics; uncertainty in medicine; uncertainty in Video/Image Processing (UVIP).
This book constitutes the refereed proceedings of the 10th Iberoamerican Congress on Pattern Recognition, CIARP 2005, held in Havana, Cuba in November 2005. The 107 revised full papers presented together with 3 keynote articles were carefully reviewed and selected from more than 200 submissions. The papers cover ongoing research and mathematical methods for pattern recognition, image analysis, and applications in such diverse areas as computer vision, robotics, industry, health, entertainment, space exploration, telecommunications, data mining, document analysis, and natural language processing and recognition.