Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Airbnb (Air Bed and Breakfast) Listing Analysis Through Machine Learning Techniques

Xiang Li, Jingxi Liao, Tianchuan Gao

Source Title: Biomedical and Business Applications Using Artificial Neural Networks and Machine Learning

DOI: 10.4018/978-1-7998-8455-2.ch008

Chapter PDF Download Open access chapters are freely available for download

Abstract

Machine learning is a broad field that contains multiple fields of discipline including mathematics, computer science, and data science. Some of the concepts, like deep neural networks, can be complicated and difficult to explain in several words. This chapter focuses on essential methods like classification from supervised learning, clustering, and dimensionality reduction that can be easily interpreted and explained in an acceptable way for beginners. In this chapter, data for Airbnb (Air Bed and Breakfast) listings in London are used as the source data to study the effect of each machine learning technique. By using the K-means clustering, principal component analysis (PCA), random forest, and other methods to help build classification models from the features, it is able to predict the classification results and provide some performance measurements to test the model.

Chapter Preview

Top

Introduction

Nowadays, machine learning (ML) is well-known and can be used in solving different types of problems such as probability, convex analysis and approximation theory. It is a type of artificial intelligence (AI) and it mainly focuses on letting the computer learn by itself without the control from humans (Expert.ai Team, 2020). It may look difficult to some beginners, but the method we mentioned here is about classification from supervised learning, clustering, and dimensionality reduction which is easy to explain and understand. Moreover, we want to show not only the effect of machine learning but also how close this technique can be applied to our daily life, so we use the dataset from Airbnb listings to do the analysis.

Airbnb which stands for Air Bed and Breakfast, a famous online marketplace for lodging, is often used by a large number of travelers and landlords. It provides a platform between tenants and renters and helps them match each other easily and conveniently. It was built in 2008 and started in San Francisco, California USA before spreading to all over the world (Bivens, 2019). Based on some statistics, the Airbnb covers 220 countries and regions with active listings, has nearly 500 million guests since its creation and was joined by 14,000 new hosts in each month of 2021 (Deane, 2021). In order to keep our data source comprehensive and multifarious, we select the Airbnb listing from London as a dataset which contains 76,619 numbers of listings information and over 8 features. Then, we use K-means clustering, hierarchical clustering, Principal Component Analysis, random forest to analyze the date we choose and we will use the decision tree to predict the data after the analysis process.

Firstly, We are going to introduce K-means clustering. K-means clustering is one of the unsupervised learning which is easy to explain. Cluster is a common type of data analysis and it is used to separate the original data to different subgroups or clusters, so the data with the same group will be very similar (Dabbura, 2018). Furthermore, K-mean, which is a kind of algorithm of the centroid-based and distance-based, is used to assign different data points to different clusters through the calculation of the distance from the point to the cluster centroid which is randomly selected in the beginning (Sharma, 2019). After that, we will reselect new cluster centroids and redo the assigned process again and again, so our goal in the K-mean cluster is to repeat the select and assign process and find suitable clusters with minimal distance from each data point to the cluster centroid (Sharma, 2019).

The second method we use is hierarchical clustering which is similar to the first method that is a type of unsupervised learning and is to cluster data points but with different standards. In hierarchical clustering, we initially consider each of the data points as different clusters and then find the closest two clusters and merge them together (Patlolla, 2018). Hierarchical clustering is similar to the K-mean cluster in that those processes will run cyclically but it is different that all the data points will be in a single cluster. Compare K-mean clustering with hierarchical clustering, we have the assumption that if the dataset has a large number of variables, it is better to use K-mean clustering and if we want the result explicable and structured, hierarchical clustering is more suitable (Das, 2020).

Moreover, we also mention Principal Component Analysis (PCA) during analyzing the dataset. Principal Component Analysis which is also called PCA is a method to reduce or refine the dimension of a dataset and the smaller dataset which we transferred from the original dataset still contains the important information (Jaadi, 2021). Therefore, our goal in the PCA is to make the dataset concise and effective.

Key Terms in this Chapter

Supervised Learning: A method in machine learning uses the model that has been trained to analyze the data.

Principal Component Analysis (PCA): A method used in data analysis is to refine the size of data and make the dataset effectively.

Unsupervised Learning: A technique in machine learning that allows users to run the model without supervision.

K-Means Clustering: A kind of algorithm that separates different data points to different clusters based on different values.

Hierarchical Clustering: A method separates different data points to different clusters based on hierarchy and merge different clusters to one.

Decision Tree: A method that build a tree-like models to present all the possible consequences in different kinds of data is used in the process of data prediction.

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

Airbnb (Air Bed and Breakfast) Listing Analysis Through Machine Learning Techniques

Abstract

Introduction

Key Terms in this Chapter

Complete Chapter List