• Home
  • Academic
  • Publications
  • Resources
  • Contact
  • Home
  • Academic
  • Publications
  • Resources
  • Contact

Code Notebooks

Github Link

2/2/2018

 
New code will be posted over at Github!
​Earlier code is still available on Github in the form of gists.

Analyzing the Properties of Episodic Word Lists

1/23/2017

 
The following code can be used to:
​1) Generate a list of words within a given frequency class (or a mix of frequency classes) from the ​SUBTLEXus database of word frequencies.
2) Automatically analyze the properties of a given list, including:
  • word frequency
  • word length
  • letter n-gram frequency
3) Compare a set of n lists on these same properties, as well as:
  • orthographic similarity (Levenshtein edit distance)
  • semantic similarity (word2vec)

Quick Data Processing with Pandas

12/9/2015

 
This demo script shows how tools from the Pandas library can be used to quickly and efficiently process experimental data generated with PsychoPy.

Writing a Twitter Bot

3/22/2015

 
​The following code illustrates how to write a Twitter bot that automatically finds relevant users to follow, and unfollows users that haven't followed back. In the example usage, the bot intakes a seed list of influencers and harvests information about their followers. It then selectively follows a subset of the most relevant users, given a set of customizable conditions (e.g., language, keywords, Tweet frequency). In keeping with API limits, the bot auto-follows 250 new users at a time, caching a "blacklist" of users that have already been added.

Estimating Sequence Probabilities with First-Order Markov Models

6/5/2014

 

Extracting Frequency Information from Google Search

4/22/2013

 

Author Classification using NLTK

12/12/2012

 

    Code Snippets

    Introductory coding demos in Python. Click the file name in the lower left-hand corner to view full screen at Github.

    Archives

    February 2018
    January 2017
    December 2015
    March 2015
    June 2014
    April 2013
    December 2012

    Categories

    All
    Data Analysis
    Episodic Memory
    Language Processing
    N-gram
    NLP
    NLTK
    Social Media
    Stylometrics
    Web Scraping

Powered by Create your own unique website with customizable templates.