Office-Home Dataset

Visualization of the office-home dataset on the Deep Lake UI

What is Office-Home Dataset?

The Office-Home dataset was created to assess deep learning algorithms for domain adaptation-based object recognition. The dataset consists of images from 4 different domains which include art, clip art, product, and Real-World images. The dataset contains images of 65 types of objects commonly found in Office-Home Settings.

Download Office-Home Dataset in Python

Instead of downloading the Office-Home Dataset in Python, you can effortlessly load it in Python via our Deep Lake open-source with just one line of code.

Load Office-Home Dataset in Python

					import deeplake
ds = deeplake.load('hub://activeloop/office-home-domain-adaptation')

Office-Home Dataset Structure

Office-Home Data Fields
  • images: tensor containing images
  • domain_objects: labels that represent 65 categories of objects in each domain
  • domain_categories: labels that represent 4 domain categories

How to use Office-Home Dataset with PyTorch and TensorFlow in Python

Train a model on Office-Home Dataset with PyTorch in Python

Let’s use Deep Lake built-in PyTorch one-line dataloader to connect the data to the compute:

					dataloader = ds.pytorch(num_workers = 0, batch_size= 4, shuffle = False)
Train a model on Office-Home Dataset with TensorFlow in Python
					dataloader = ds.tensorflow()

Office-Home Dataset Creation

Data Collection and Normalization Information
Python crawler was used for image collection. There were 100,000 images of 120 different objects. To make sure that the right objects are present in the image, the dataset was cleaned. It was also ensured that each category has a certain number of images. The last version of the dataset has 15,500 images of 65 different objects.

Additional Information about Office-Home Dataset

Office-Home Dataset FAQs

