Art Datase / Museum

Artwork : 130,000+
Format: CSV
License : CC0-1.0 license

National Gallery of Art Open Data Program

The dataset provides data records relating to the 130,000+ artworks in our collection and the artists who created them. You can download the dataset free of charge without seeking authorization from the National Gallery of Art.
The dataset is published in CSV format and uses UTF-8 encoding, and is updated daily. Links and references to images and other media such as audio and video files are contained in the dataset, but the images and media files themselves are not included under this program.

WebsiteGithub

Records : 15,679+
Format: CSV,JSON
License : CC0-1.0 license

The Museum of Modern Art (MoMA) Collection

The Artists dataset contains 15,679 records, representing all the artists who have work in MoMA’s collection and have been cataloged in our database. It includes basic metadata for each artist, including name, nationality, gender, birth year, death year, Wiki QID, and Getty ULAN ID.
At this time, both datasets are available in CSV format, encoded in UTF-8. While UTF-8 is the standard for multilingual character encodings, it is not correctly interpreted by Excel on a Mac. Users of Excel on a Mac can convert the UTF-8 to UTF-16 so the file can be imported correctly. The datasets are also available in JSON.

WebsiteAPI

Records : 470,000+
Format: CSV
License : CC0-1.0 license

The Metropolitan Museum of Art Open Access CSV

The Metropolitan Museum of Art provides select datasets of information on more than 470,000 artworks in its Collection for unrestricted commercial and noncommercial use.
At this time, the datasets are available in CSV format, encoded in UTF-8. While UTF-8 is the standard for multilingual character encodings, it is not correctly interpreted by Excel on a Mac. Users of Excel on a Mac can convert the UTF-8 to UTF-16 so the file can be imported correctly.

WebsiteGithub

Artworks : 70,000+
Artists : 3,500+
Format: CSV,JSON
License : CC0-1.0 license

The Tate Collection

The dataset in this repository was last updated in October 2014. Tate has no plans to resume updating this repository, but we are keeping it available for the time being in case this snapshot of the Tate collection is a useful tool for researchers and developers.
Here we present the metadata for around 70,000 artworks that Tate owns or jointly owns with the National Galleries of Scotland as part of ARTIST ROOMS. Metadata for around 3,500 associated artists is also included.

WebsiteGithub

Size : 1.75 GB
Access : API, Data Dumps
Format: JSON
License : CC0-1.0 license

The Art Institute of Chicago

Founded in 1879, the Art Institute of Chicago is one of the world’s major museums, housing an extraordinary collection of objects from across places, cultures, and time. We are also a place of active learning for all—dedicated to investigation, innovation, education, and dialogue—continually aspiring to greater public service and civic engagement.
We provide API and data dumps. These data dumps are updated nightly. They are generated from our API. As such, they contain the same data as our API, and their schema mirrors that of the API. The data is dumped in JSON format, with one JSON file per record. Records are grouped by API resource type.

WebsiteData DumpsAPI

Artworks : 64,000+
Access : API, Data Dumps
Format: CSV,JSON
License : CC0-1.0 license

The Cleveland Museum of Art Open Access

The Cleveland Museum of Art (CMA) was founded in 1913 “for the benefit of all the people forever.” The museum strives to help the broadest possible audience understand and engage with the world’s great art. The Cleveland Museum of Art is one of the most comprehensive art museums in the world and one of northeastern Ohio’s principal civic and cultural institutions.
The Cleveland Museum of Art provides datasets of information on more than 64,000 artwork records in its Collection for unrestricted commercial and noncommercial use. Additionally, the museum provides image assets for over 37,000 works, which are made available under the same terms. Links to the web, print, and full-sized, uncompressed versions of these images are included in the dataset where applicable.

WebsiteData DumpsAPI

Artworks : 245,688+
Access : API
Format: JSON

Harvard Art Museums

The Harvard Art Museums API is a REST-style service designed for developers who wish to explore and integrate the museums’ collections in their projects. The API provides direct access to JSON formatted data that powers this website and many other aspects of the museums.
And every request must be accompanied by the apikey parameter and an API key. The API uses keys to authenticate requests. API keys take the form 00000000-0000-0000-0000-000000000000.

WebsiteAPI

Movie Poster / Music Cover

MovieNet

1.1K Movies, 60K trailers, 375K meta etc., all freely avaliable at MovieNet.

Poster: 4M+
Size: 1.3GB

Github

One Million Audio Cover Images

This collection includes over one million JPG, PNG and GIF album covers.

Cover: 1 M+
Size: 161.4 GB

Website

MusicBrainz

MusicBrainz is an open music encyclopedia that collects music metadata and makes it available to the public.

Format: JSON,XML
License : CC0-1.0 license

WebsiteCover API

Movie Genre from its Poster

The collected dataset contains IMDB Id, IMDB Link, Title, IMDB Score, Genre and link to download movie posters.

Poster: 39,371
Size: 26.78 MB

Kaggle

Movie-Poster Dataset

We collected 1,500 movie posters featuring various artistic-style titles to address the current market’s lack of artistic-style text data

Poster: 1500
Size: 1.0 GB

GithubGoogle Drive