Machine Learning Datasets Machine Learning Datasets
  • GitHub
  • Slack
  • Documentation
Get Started
Machine Learning Datasets Machine Learning Datasets
Get Started
Machine Learning Datasets
  • GitHub
  • Slack
  • Documentation

Docy

Machine Learning Datasets

  • Folder icon closed Folder open iconDatasets
    • MNIST
    • ImageNet Dataset
    • COCO Dataset
    • CIFAR 10 Dataset
    • CIFAR 100 Dataset
    • FFHQ Dataset
    • Places205 Dataset
    • GTZAN Genre Dataset
    • GTZAN Music Speech Dataset
    • The Street View House Numbers (SVHN) Dataset
    • Caltech 101 Dataset
    • LibriSpeech Dataset
    • dSprites Dataset
    • PUCPR Dataset
    • RAVDESS Dataset
    • GTSRB Dataset
    • CSSD Dataset
    • ATIS Dataset
    • Free Spoken Digit Dataset (FSDD)
    • not-MNIST Dataset
    • ECSSD Dataset
    • COCO-Text Dataset
    • CoQA Dataset
    • FGNET Dataset
    • ESC-50 Dataset
    • GlaS Dataset
    • UTZappos50k Dataset
    • Pascal VOC 2012 Dataset
    • Pascal VOC 2007 Dataset
    • Omniglot Dataset
    • HMDB51 Dataset
    • Chest X-Ray Image Dataset
    • NIH Chest X-ray Dataset
    • Fashionpedia Dataset
    • DRIVE Dataset
    • Kaggle Cats & Dogs Dataset
    • Lincolnbeet Dataset
    • Sentiment-140 Dataset
    • MURA Dataset
    • LIAR Dataset
    • Stanford Cars Dataset
    • SWAG Dataset
    • HASYv2 Dataset
    • WFLW Dataset
    • Visdrone Dataset
    • 11k Hands Dataset
    • QuAC Dataset
    • LFW Deep Funneled Dataset
    • LFW Funneled Dataset
    • Office-Home Dataset
    • LFW Dataset
    • PlantVillage Dataset
    • Optical Handwritten Digits Dataset
    • UCI Seeds Dataset
    • STN-PLAD Dataset
    • FER2013 Dataset
    • Adience Dataset
    • PPM-100 Dataset
    • CelebA Dataset
    • Fashion MNIST Dataset
    • Google Objectron Dataset
    • CARPK Dataset
    • CACD Dataset
    • Flickr30k Dataset
    • Kuzushiji-Kanji (KKanji) dataset
    • KMNIST
    • EMNIST Dataset
    • USPS Dataset
    • MARS Dataset
    • HICO Classification Dataset
    • NSynth Dataset
    • RESIDE dataset
    • Electricity Dataset
    • DRD Dataset
    • Caltech 256 Dataset
    • AFW Dataset
    • PACS Dataset
    • TIMIT Dataset
    • KTH Actions Dataset
    • WIDER Face Dataset
    • WISDOM Dataset
    • DAISEE Dataset
    • WIDER Dataset
    • LSP Dataset
    • UCF Sports Action Dataset
    • Wiki Art Dataset
    • FIGRIM Dataset
    • ANIMAL (ANIMAL10N) Dataset
    • OPA Dataset
    • DomainNet Dataset
    • HAM10000 Dataset
    • Tiny ImageNet Dataset
    • Speech Commands Dataset
    • 300w Dataset
    • Food 101 Dataset
    • VCTK Dataset
    • LOL Dataset
    • AQUA Dataset
    • LFPW Dataset
    • ARID Video Action dataset
    • NABirds Dataset
    • SQuAD Dataset
    • ICDAR 2013 Dataset
    • Animal Pose Dataset
  • Folder icon closed Folder open iconDeep Lake Docs Home
  • Folder icon closed Folder open iconDataset Visualization
  • API Basics
  • Storage & Credentials
  • Getting Started
  • Tutorials (w Colab)
  • Playbooks
  • Data Layout
  • Folder icon closed Folder open iconShuffling in ds.pytorch()
  • Folder icon closed Folder open iconStorage Synchronization
  • Folder icon closed Folder open iconTensor Relationships
  • Folder icon closed Folder open iconQuickstart
  • Folder icon closed Folder open iconHow to Contribute

Office-Home Dataset

Estimated reading: 4 minutes

Visualization of the office-home dataset in the Deep Lake UI

Office-Home Dataset

What is Office-Home Dataset?

The Office-Home dataset was created to assess deep learning algorithms for domain adaptation-based object recognition. The dataset consists of images from 4 different domains which include art, clip art, product, and Real-World images. The dataset contains images of 65 types of objects commonly found in Office-Home Settings.

Download Office-Home Dataset in Python

Instead of downloading the Office-Home Dataset in Python, you can effortlessly load it in Python via our Deep Lake open-source with just one line of code.

Load Office-Home Dataset in Python

				
					import deeplake
ds = deeplake.load('hub://activeloop/office-home-domain-adaptation')
				
			

Office-Home Dataset Structure

Office-Home Data Fields
  • images: tensor containing images
  • domain_objects: labels that represent 65 categories of objects in each domain
  • domain_categories: labels that represent 4 domain categories

How to use Office-Home Dataset with PyTorch and TensorFlow in Python

Train a model on Office-Home Dataset with PyTorch in Python

Let’s use Deep Lake built-in PyTorch one-line dataloader to connect the data to the compute:

				
					dataloader = ds.pytorch(num_workers = 0, batch_size= 4, shuffle = False)
				
			
Train a model on Office-Home Dataset with TensorFlow in Python
				
					dataloader = ds.tensorflow()
				
			

Office-Home Dataset Creation

Data Collection and Normalization Information
Python crawler was used for image collection. There were 100,000 images of 120 different objects. To make sure that the right objects are present in the image, the dataset was cleaned. It was also ensured that each category has a certain number of images. The last version of the dataset has 15,500 images of 65 different objects.

Additional Information about Office-Home Dataset

Office-Home Dataset Description

  • Homepage: https://www.hemanthdv.org/officeHomeDataset.html
  • Repository: N/A
  • Paper: https://openaccess.thecvf.com/content_cvpr_2017/papers/Venkateswara_Deep_Hashing_
    Network_CVPR_2017_paper.pdf
  • Point of Contact: N/A
Office-Home Dataset Curators
Hemanth Venkateswara, Jose Eusebio, Shayok Chakraborty and Sethuraman Panchanathan
 
Office-Home Dataset Licensing Information
More information about the license can be found here. Deep Lake users may have access to a variety of publicly available datasets. We do not host or distribute these datasets, vouch for their quality or fairness, or claim that you have a license to use the datasets. It is your responsibility to determine whether you have permission to use the datasets under their license. If you’re a dataset owner and do not want your dataset to be included in this library, please get in touch through a GitHub issue. Thank you for your contribution to the ML community!
Office-Home Dataset Citation Information
				
					@inproceedings{venkateswara2017deep, 
title={Deep hashing network for unsupervised domain adaptation},
author={Venkateswara, Hemanth and Eusebio, Jose and Chakraborty, Shayok and Panchanathan, Sethuraman}, 
booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition}, 
pages={5018--5027}, 
year={2017} }
				
			

Office-Home Dataset FAQs

What is the Office-Home dataset for Python?

The Office-Home dataset was developed to assess domain adaptation algorithms for object recognition using deep learning. The dataset is made up of images from four different domains—artistic, product, real-world images, and clip art. A Python web crawler that crawled through several search engines and online image directories was used to collect the images in the dataset.

What is the Office-Home dataset used for?

The Office-Home dataset is used as a benchmark dataset for domain adaptation. It contains four domains and each domain consists of 65 categories. The four domains include art (a collection of artistic images in the form of sketches), clipart (a collection of clipart images), product (a domain containing images of objects without a background), and real-world images (a domain containing images of objects captured with a regular camera).

How to download the Office-Home dataset in Python?

With the open-source package Activeloop Deep Lake in Python, you can load the Office-Home dataset fast with one line of code. See detailed instructions on how to load the Office-Home dataset in Python.

How can I use Office-Home dataset in PyTorch or TensorFlow?

Using the open-source package Activeloop Deep Lake in Python you can stream the Office-Home dataset while training a model in PyTorch or TensorFlow with one line of code. See detailed instructions on how to train a model on the Office-Home dataset with PyTorch in Python or train a model on the Office-Home dataset with TensorFlow in Python.

Datasets - Previous LFW Funneled Dataset Next - Datasets LFW Dataset
Leaf Illustration

© 2022 All Rights Reserved by Snark AI, inc dba Activeloop