Instagram web scraper (without auth API)
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
KaKi87 7d799fd9c7 Warn client when session ID changed 3 months ago
.gitignore Upgrade to 1.0.1 4 months ago
LICENSE Initial commit 4 months ago
README.md Update README.md 3 months ago
index.js Warn client when session ID changed 3 months ago
package.json Upgrade to 1.0.9, added `business` property to profile, more changes upcoming 3 months ago
yarn.lock Upgrade to 1.0.3 4 months ago

README.md

ig-scraper

Scrape data from Instagram without applying for the authenticated API.

Getting started

Prerequisites

  • NodeJS
  • NPM
  • Yarn

Install

From npm

yarn add @kaki87/ig-scraper

or

npm i @kaki87/ig-scraper

Use

const Insta = require('@kaki87/ig-scraper');
const InstaClient = new Insta();

Authentication

Authentication allows you to access private profile as long as you follow them.

Importing your session ID
  • Go to instagram.com
  • Login (if not already logged in)
  • Open development tools (Ctrl + Shift + I)
  • Get the sessionid cookie value
    • For chromium-based browsers : application tab
    • For firefox-based browsers : storage tab
Code
InstaClient.authBySessionID(yourSessionID)
	.then(account => console.log(account))
	.catch(err => console.error(err));

If authentication is successfull, you’ll get the form data from accounts/edit :

{
	"first_name": "",
	"last_name": "",
	"email": "",
	"is_email_confirmed": true,
	"is_phone_confirmed": true,
	"username": "",
	"phone_number": "",
	"gender": 1,
	"birthday": null,
	"biography": "",
	"external_url": "",
	"chaining_enabled": true,
	"presence_disabled": false,
	"business_account": false,
	"usertag_review_enabled": false
}

If your session ID is invalid, you’ll get the 401 error.

Username/password authentication may be supported in the future.

Get

These methods allows you to get specific elements from Instagram while you know exactly what you’re looking for.

Errors handling

get may return errors in the two following cases.

  • Request error : failed to get data from Instagram (HTTP code)
  • Parsing error : failed to parse data returned by Instagram (406)
  • No content : nothing to parse (204)
  • Authentication required : session ID required to access data (401)
Get profile by username
InstaClient.getProfile(username)
	.then(profile => console.log(profile))
	.catch(err => console.error(err));

Result

  • name string - public full name
  • pic url - public profile picture
  • bio string - public biography
    website url - public website
    more info about bio & website
  • private boolean - account private state
  • access boolean - access to the profile’s feed
    In order to have access to a private account’s feed, you must have sent him a follow request that he accepted.
  • verified boolean - account verified state
  • followers integer - number of users following this profile
  • following integer - number of users this profile follows
  • posts integer - number of posts this profile published
  • lastPosts array of posts - last posts
    This property is empty ([]) when the profile doesn’t have any post but null if access is false (denied).
  • link url - link to the profile’s page
  • business string - business category (where applicable and profile unblocked)
  • user object - user relevant properties (while authenticated) :
    • mutualFollowers array of usernames - people following you and this profile
    • blocking boolean - you blocked this profile
    • blocked boolean - this profile blocked you (only available property in user while true)
    • requesting boolean - you sent a follow request to this profile (if private)
    • requested boolean - this profile sent you a follow request (if yours is private)
    • following boolean - you’re following this profile
    • followed boolean - this profile follows you

TBA : story

Get hashtag
InstaClient.getHashtag(hashtag)
	.then(hashtag => console.log(hashtag))
	.catch(err => console.error(err));

Result

  • pic url - hashtag profile pic (can’t find out how it is chosen)
  • posts integer - number of posts containing this hashtag
  • featuredPosts array of posts - featured posts published with this hashtag
    lastPosts array of posts - last posts published with this hashtag
    more info about hashtag posts
  • link url - link to the hashtag’s page
  • user object - user relevant properties (while authenticated) :
    • following boolean - you subscribed to this hashtag (receiving posts in your personal feed)

TBA : stories

Get location by ID

Unfortunately, using IDs is currently the only way to get a location, at least for now.

InstaClient.getLocation(id)
	.then(location => console.log(location))
	.catch(err => console.error(err));

Result

  • pic url - location profile pic
  • posts integer - posts published from that location
  • address object
    • street string
    • zipCode string
    • city string
    • latitude float
    • longitude float
  • website url - place’s website
  • phone string - place’s contact phone number
  • featuredPosts array of posts - featured posts published from this location
    lastPosts array of posts - last posts published from this location
  • link url - link to this location’s page
Array of posts

This is a subset of a real post, containing the following properties :

  • shortcode string - post identifier
  • caption string - post description
  • comments integer - number of comments
  • likes integer - number of likes
  • thumbnail url - post thumbnail
    Always static image wether it’s a photo or a video post, lower quality.
Get post by shortcode

The shortcode is the post’s identifier : the link to a post is instagram.com/p/shortcode.

InstaClient.getPost(shortcode)
	.then(post => console.log(post))
	.catch(err => console.error(err));

Result

  • author object - a subset of a profile’s properties.
    • username string
    • name string
    • pic url
    • verified boolean
    • link url
  • location
    • name string
    • city string
  • contents array of posts
    • type string - post type : photo or video
    • url string - link to original post file (jpg, mp4, …)
    • if type is video :
      thumbnail string - link to thumbnail
      views integer - number of views
  • tagged array of usernames - people tagged in post contents
  • likes integer - number of likes
  • caption string - post description
  • hashtags array of hashtags - hashtags mentioned in post description
  • mentions array of usernames - people mentioned in post description
  • comments array of objects (Max 40)
    • user string - comment author’s username
    • content string - comment content
    • timestamp epoch
    • hashtags array of hashtags
    • mentions array of usernames
    • likes integer
  • timestamp epoch
  • link string - link to the post

Search

Search profile
InstaClient.searchProfile(query)
	.then(profiles => console.log(profiles))
	.catch(err => console.error(err));

Result in array : a subset of profile.

  • username
  • name
  • pic
  • private
  • verified
  • followers
  • user
    • following
Search hashtag
InstaClient.searchHashtag(hashtag)
	.then(hashtags => console.log(hashtags))
	.catch(err => console.error(err));

Result in array : a subset of hashtag.

  • name
  • posts
Search location
InstaClient.searchLocation(location)
	.then(locations => console.log(locations))
	.catch(err => console.error(err));

Result in array : a subset of location.

  • id
  • name
  • address
    • street
    • city
    • latitude
    • longitude

Subscribe

  • interval integer (optional) - time in seconds between requests. Default : 30
  • lastPost string (optional) - shortcode from which to begin if not the next one to be published.
Subscribe to user posts
InstaClient.subscribeUserPosts(username, interval, lastPost).subscribe({
	next: shortcode => console.log(shortcode),
	error: err => console.error(err)
});
Subscribe to hashtag posts
InstaClient.subscribeHashtagPosts(hashtag, interval, lastPost).subscribe({
	next: shortcode => console.log(shortcode),
	error: err => console.error(err)
});

Authenticated requests

Get account notifications
InstaClient.getAccountNotifications()
	.then(notifications => console.log(notifications))
	.catch(err => console.error(err));

Result in array : notification

  • id string - Notification identifier
  • timestamp epoch
  • type string - Notification type : like, mention, comment, follow
  • post
    • shortcode
    • thumbnail
  • by
    • username
    • name
    • pic
  • content string - Comment content (where applicable)
Subscribe to account notifications
  • lastNotification string (optional) - Notification ID
InstaClient.subscribeAccountNotifications(interval, lastNotification).subscribe({
	next: notification => console.log(notification),
	error: err => console.error(err)
});

Changelog

  • 1.0.0 (2019-03-26) • Initial release
  • 1.0.1 (2019-03-27)
    • Fixed throw error scope
    • Fixed single photo post wrongly structured
    • Added support for comments
    • Added support for hashtags, mentions and tags in posts and comments
    • Added posts subscriptions feature from users (untested) and hashtags
  • 1.0.2 (2019-03-27) • Added support for videos
  • 1.0.4 (2019-03-27)
    • Fixed video post thumbnail & views count
    • Using promises & observable
  • 1.0.5 (2019-03-27) • Added proper error for private accounts
  • 1.0.6 (2019-03-31) • Private account access doesn’t require mutual follow
  • 1.0.7 (2019-04-03) • Added profile’s last posts analytics #1 by juliomontilla100 + more
  • 1.0.8 (2019-04-14)
    • Using classes
    • Added support for authentication using session cookie (only allows to access friend profile)
    • Added support for locations
    • Added search feature for profiles, hashtags & locations
    • Added user-relevant properties
    • Added support for notifications history & subscription
    • Fixed subscriptions since #1
    • Removed useless id properties
  • 1.0.9 (2019-04-??)
    • Added business property to profile (where applicable)
    • Automatically access public profile anonymously when user blocked