ffscraper¶

Yet another Python package for scraping FanFiction.Net…

pip install ffscraper

Authors: Alexander L. Hayes (@batflyer)

Version: 0.2.0

Documentation: Module Index

Search: Search Page

Source: GitHub

Bugtracker: GitHub Issues

FanFiction.Net was established in 1998 and is among the world’s largest collection of user-submitted fanfiction (works of fanfiction authored by fans of existing stories; such as movies, books, or TV shows). Recently the large amount of easily-available user content has drawn interest in analyzing the content and creative differences between original works and their fanfiction counterparts [1], and [2] created an anonymized dataset of the metadata.

This project is twofold: creating open-source systems for scraping content, and using that content to build open-source systems which can be used by the FanFiction.Net community.

Installation and Usage¶

Interact with the scraper from the commandline:

$ pip install ffscraper
$ python -m ffscraper --help
$ python -m ffscraper -s 123

Or import the Python package and start building your own systems:

from __future__ import print_function
import ffscraper as ffs

sids = ['123', '124', '125']

for id in sids:
    story = ffs.fanfic.story.scraper(id)
    print(story)

Contents:

ffscraper package

[1]	Milli, Smitha and David Bamman, “Beyond Canonical Texts: A Computational Analysis of Fanfiction.” Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing.

[2]	Yin, K., Aragon, C., Evans, S. and Katie Davis. “Where No One Has Gone Before: A Meta-Dataset of the World’s Largest Fanfiction Repository.” Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems. ACM, 2017.

ffscraper¶

Installation and Usage¶

ffscraper

Navigation

Related Topics

Authors:	Alexander L. Hayes (@batflyer)
Version:	0.2.0
Documentation:	Module Index
Search:	Search Page
Source:	GitHub
Bugtracker:	GitHub Issues