4.7 KiB
Instagram -> Pixelfed Mirror in Python (3+)
This Python scripts with a simple HTTP API to use it, can mirror multiple accounts from Instagram to your Pixelfed instance.
Disclaimer
Mirroring Instagram accounts will copy (clone) the images and videos from Instagram to your server, thus moving the data sovereignty to you as an Admin.
Always make sure you mirror accounts whos content is not strictly copyrighted (or branded) and keep the tagging system of the source code that marks accounts on Pixelfed as Mirrors.
Users of your Pixelfed instance have to know this accounts are not real accounts as someone on Instagram could create a real Pixelfed account in the future.
Pre-requisites
- You need an Instagram account to make this bot work.
- A Pixelfed instance you own. It must be hosted or managed by you (you need shell access for php artisan commands)
- Python 3+ installed on the same machine as the Pixelfed instance (this is easy, mostly all standard Debian based Linux systems have python3)
- A bit more disk space for Pixelfed media storage. (It depends on how many accounts you mirror and how often that accounts post on IG)
- Patch 1 file on Pixelfed deployed code (being on docker or not)
What can you mirror?
- Posts with images (multiple images support).
- Posts with videos (only 1 video).
- Each posts captions (post description) is represented as splitted comments.
What can't you mirror (yet) ?
- Stories. (will support that soon)
Installation
Patch some files
As i said on the Pre-requisites, you will need to patch the following files on Pixelfed running code:
This patch is needed to disable Pixelfed API rate limiting completely as it might pop-up when using your mirror bot.
We asume your Pixelfed installation is at /var/www
(if it isn't, just change the steps to match your installation)
/var/www/vendor/laravel/framework/src/Illuminate/Routing/Middleware/ThrottleRequests.php
Add return 99999999;
after the line where it checks if the user is authenticated.
/**
* Resolve the number of attempts if the user is authenticated or not.
*
* @param \Illuminate\Http\Request $request
* @param int|string $maxAttempts
* @return int
*/
protected function resolveMaxAttempts($request, $maxAttempts)
{
if (Str::contains($maxAttempts, '|')) {
$maxAttempts = explode('|', $maxAttempts, 2)[$request->user() ? 1 : 0];
}
if (! is_numeric($maxAttempts) && $request->user()) {
return 99999999;
$maxAttempts = $request->user()->{$maxAttempts};
}
return (int) $maxAttempts;
}
Setup the environment
- Clone the project somewhere
git clone https://git.nogafam.es/nogafam/ig-pixelfed-mirror.git
- Create needed directories for the environment
cd ig-pixelfed-mirror/
mkdir cache db headers
# DIRECTORIES EXPLAINED
# ----
# "cache" will keep the cached html/json/jpg/mp4 files from Instagram (it can be safely cleaned, it doesn't need backup)
# "db" will keep files containing information about your mirrored accounts
# (such as: login credentials, cookies for Pixelfed, IG posts that was already added (to avoid duplicates), etc...)
# "headers" is a directory you need to fill with TXT plain text files containing some key headers from Instagram Web sessions
# (I explain this on step 5!)
- Copy
config.json.example
toconfig.json
and configure to your needs. Make sure you remove the comments from the file because Python doesn't like them. - Copy
scripts/user_create.example
toscripts/user_create
. This script runs thephp artisan user:create
command with the given positional parameters to automatically create local Pixelfed accounts with forced email verification. Adapt to your needs. - Log in to your Instagram account on a Browser (with remember ticked ON), open your browser's developer console at "Network" tab, navigate throught IG a bit and copy the Request Headers of any request done to instagram.com. Make sure there is
X-IG-App-ID, X-IG-WWW-Claim, Cookie
headers set.
# create a new headers file on "headers/" on your working directory
# you can view an example at "headers.example" file
vim headers/1.txt
# FILE EDITING
# ----
# 1. Add the headers you copied from the browser.
# 2. Keep only the "X-IG-App-ID, X-IG-WWW-Claim, Cookie" headers,
# "User-Agent" might be also good to keep (to match the browser)
# (It uses a common User-Agent by default)
# -
# Done editing
Create as much headers TXT files as you wish. It will be used randomly on requests to Instagram by the mirror bot.