Commonplace books, or commonplaces, were a way to compile knowledge, usually by writing information into books. Such books were essentially scrapbooks filled with items of every kind. This commonplace includes datasets, research resources, stats, econometric methods, libraries in R and python, and the like. Because I work in tech policy, there is a strong leaning towards tech, innovation and telecom in the legal and legislative research section.
Public policy cheatsheet
It is still in beta.
Economic lectures + notes
- Price Theory in Economics
- Intermediate Micro Review
- Notes on Macroeconomic Theory by Steve Williamson
- Discrete-Choice Models of Demand
- Decision Theory: A Brief Introduction [source]
- Regional Impact Models by William A. Schaffer; A critique of input-output models
- Market Efficiency and Market Failures by Jon Steinsson; The concept of efficiency in economics
- Profit maximization
- Intermediate Macroeconomics: Consumption Eric Sims
- Options markets
- Important Papers for Quantitative Traders
- Public finance notes
- Scott H. Irwin’s Personal Collection of Hidden Academic Treasures. “My collection of academic papers that are not well known today but I think are significant and worth preserving.” Mostly on the topic of agricultural economics.
Reading lists
- “Reading List in Macroeconomics and Monetary Economics” (2014)
- “New Institutional Economics: Reading List, Introductory”
- “Democracy in a Digital Age” Papacharissi
- “Seminar in Media Studies” Papacharissi
Math + stats
If you have technical questions, check out Luke Stein’s Stanford graduate economics core. 65 pages of stats review. For a gentler introduction to the ideas of probability, check out Jane Street’s Introduction to Probability and Markets. “Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations.” (link)
- Power Laws in Economics: An Introduction
- Notes for Nonparametric Statistics
- A list of great math videos online.
- NIST/SEMATECH e-Handbook of Statistical Methods
- Rob Hyndman’s Forecasting Principles and Practice (FPP3) book is a great resource that covers both the math and applications to real data.
- Machine Learning Mastery
- Bayesian statistics with R
- Statistical Rethinking: A Bayesian Course (with Code Examples in R/Stan/Python/Julia)
- YouTube series: Statistical Rethinking Winter 2019
- An explainer on the problem of bias.
- The notes and problems from Yales Math Camp.
- Some Linear Algebra for Econometrics [pdf]
- Essentials of Stochastic Processes by Rick Durrett
- YouTube series: Statistical Rethinking Winter 2019
- An Introduction to Topos Theory
Econometrics + model making
- “This document contains the set of lecture notes from the late Gary Chamberlain’s 2010 Econometrics class (EC2120) that I (Paul Goldsmith-Pinkham) took during my economics Ph.D. at Harvard University. Gary was a remarkable teacher and this class was an amazing experience for me as a young economist.” [Github]
- Cara Jackson is collecting Wooldridge’s Twitter lessons in a Google Doc.
- Lecture Notes on Identification Strategies by Štěpán Jurajda [source]
- Quick review of OLS assumptions [itfeature]
- Quick-R: Regression Diagnostics
- Quick-R: Multiple Regression
- When Should I Use Confidence Intervals, Prediction Intervals, and Tolerance Intervals [Minitab]
- Logistic & Tobit Regression [pdf]
- Affine term structure models [pdf]
- When to include interaction terms in your regression.
- Frank Pinter has notes on
- From Christine Cai
- Metrics discussions: diff-in-diff and event studies (by Chloe East)
- DiD Reading Group (Summer 2021) (by Taylor Wright)
- Current literature on diff-in-diff (by Asjad Naqvi)
- Course videos and slides: Applied Empirical Methods (by Paul Goldsmith-Pinkham)
- Course notes from classes at UC Berkeley (by Kristy Kim)
- Graduate Public Finance lecture materials (by Owen Zidar)
- Graduate Labor Econ course materials (by David Autor)
- Graduate (Macro) Labor Econ course materials (by Florian Oswald)
- Gary Chamberlain’s lecture notes (by Paul Goldsmith-Pinkham)
- Graduate Development & Econometrics lecture notes (by Simon Quinn)
- The Stata workflow guide (by Asjad Naqvi)
- Coding and do-file structure (by Michael Stepner)
- Stata coding guide (by Julian Reif)
- Stata cheat sheets (by Tim Essam & Laura Hughes)
- Customizing Stata graphs (by Ben Jann)
- Maps in Stata (by Asjad Naqvi)
- Advanced Mapping with Stata: OpenStreetMap (OSM) and QGIS (by Asjad Naqvi)
- Stata guide on Mata (by Asjad Naqvi)
- Stata guide on MLE (by Asjad Naqvi)
- Microeconometrics Using Stata (by Colin Cameron & Pravin Trivedi)
- Tips for managing large-scale datasets efficiently in Stata
- Introduction to R (by Hans H. Sievertsen)
- Applied Economics with R (by Hans H. Sievertsen)
- Regression analysis in R (by Grant R. McDermott)
- List of open source books about R (by Pere A. Taberner)
- Big Book of R
- LaTeX Table Hints and Tips
- Statistical models across R/Python/Stata (by the Library of Statistical Techniques)
- Stata-to-Python equivalents (by Daniel M. Sullivan)
- StataTex Blog: Tips for Stata, Latex and other useful resources for applied economists (by Jan Sauermann)
- Stata and GitHub Integration (by Asjad Naqvi)
- Conducting power calculations in Stata/R (by Sabhya Gupta)
Programming for economists, R, python, libraries, packages, etc.
- Stock-flow Consistent Macroeconomic Models: A Survey | Levy Economics Institute
- The FRBNY DSGE Model Meets Julia Liberty Street Economics
- PyIO: Python Module for Input-Output Analysis
- QuantEcon
- The Causal Analytics Toolkit (CAT) provides powerful, easy-to-use software to help overcome these limitations. It makes state-of-the-art causal analytics available to anyone who has Microsoft Excel.
- How to run specification curve analysis in R.
- “Start with R for Data Science. Efficient R Programming also helps a lot. If you’re doing computational stuff specifically, though, don’t use R. Here’s how with Python or Julia. Instead of Anaconda, I’d recommend Poetry. No, pip alone won’t cut it. This is a very good comment but I’d like to add that poetry doesn’t let you use different python versions like Anaconda does and you’ll have to manage it with yet another tool like pyenv.”
- The estimatr package (https://cran.r-project.org/packages/estimatr) will do 80%+ of the regressions you need for day-to-day micro work. In ~1 line.
- lfe package (https://CRAN.R-project.org/package=lfe) is also super. Four dimensions of fixed effects, multi-way clustered SEs, no problemo.
- Need marginal effects estimates? @thosjleeper got ya covered with the margins package (https://cran.r-project.org/package=margins)
- All sorts of machine learning tools. New stuff all the time. https://cran.r-project.org/web/views/MachineLearning.html
- Read about other tools here: https://cran.r-project.org/web/views/Econometrics.html
- Another example: Competing ML packages all have different syntax, different option sets, different output structure. @topepos’s parsnip package https://tidymodels.github.io/parsnip/) and the tidymodels infrastructure are working on improving things though.
- The MatchIt and WeightIt (https://cran.r-project.org/web/packages/WeightIt…) packages aim to do similar things for propensity score analysis.
- pyEdgeworthBox provides with a tool to plot the Edgeworth box and calculate equilibrium, core, pareto effective allocation etc in the pure exchange economy. [Github]
- cspp is a package designed to allow a user with only basic knowledge of R to find variables on state politics and policy, create and export datasets from these variables, subset the datasets by states and years, create map visualizations, and export citations to common file formats (e.g., .bib). [Github]
- NeuralNetworkLawfirmNames – Using torch, generated list of law firms, trained using Law360’s Top 400 law firms by size [Github]
- Python word2vec [link]
- GeoDa [link]
- Using PGFPLOTS to make economic graphs in LATEX [link]
- Bayesian histograms for rare event classification
- Stock-flow Consistent Macroeconomic Models: A Survey | Levy Economics Institute
- The FRBNY DSGE Model Meets Julia Liberty Street Economics
- PyIO: Python Module for Input-Output Analysis
- QuantEcon
- The Causal Analytics Toolkit (CAT) provides powerful, easy-to-use software to help overcome these limitations. It makes state-of-the-art causal analytics available to anyone who has Microsoft Excel.
- How to run specification curve analysis in R.
Federal Reserve data
Federal Reserve Economic Data is a great resource for all government datasets. Some key FRED series:
- TIPS 5-Year Breakeven Inflation Rate
- TIPS 10-Year Breakeven Inflation Rate
- Real Personal Consumption Expenditures: Goods
- Real Personal Consumption Expenditures: Services
- Total Business Inventories
- Real Private Inventories
- Key interest rate metrics at FRED
- Unemployment Level/Job Openings: Total Nonfarm
- Employed full time: Median usual weekly real earnings: Wage and salary workers: 16 years and over – WR: This is a good measure of real wages.
- Nonfarm Business Sector: Real Hourly Compensation for All Employed Persons – WR: This another good measure measure of real wages.
- National income: Corporate profits before tax (without IVA and CCAdj)/Gross Domestic Product
- Real Median Family Income in the United States
- U-6 Unemployment Measure
- Producer Price Index by Commodity: Internet Advertising Sales, Excluding Internet Advertising Sold by Print Publishers
- Producer Price Index: Advertising Time Series
- Total Factor Productivity at Constant National Prices for United States
- Average Hourly Earnings of All Employees, Total Private
- Employed full time: Median usual weekly real earnings: Wage and salary workers: 16 years and over
- Consumer Price Index for All Urban Consumers: Purchasing Power of the Consumer Dollar in U.S. City Average
- Smoothed Recession Probabilities
The Atlanta Fed’s Wage Growth Tracker
Census data
The FactFinder is dead. Explore Census data instead. Other Census data:
- Small Area Income and Poverty Estimates (SAIPE)
- Business Dynamics Statistics – BDS provides annual measures of business dynamics (such as job creation and destruction, establishment births and deaths, and firm startups and shutdowns) for the economy and aggregated by establishment and firm characteristics.
- Nonemployer statistics – NES is an annual series that provides subnational economic data for businesses that have no paid employees and are subject to federal income tax. This series includes the number of businesses and total receipts by industry.
- Economic Census – The economic census serves as the foundation for the measurement of U.S. businesses, including the Island Areas, and their economic impact.
- A complete list of the Census' economic surveys – The U.S. Census Bureau business surveys and censuses measure the pulse of the U.S. economy, businesses, and governments. They provide data for businesses in the economic sectors such as manufacturing, construction, retail trade, health care, and services industries, as well as for state and local governments, and on imports and exports.
- Computer and Internet Use – In recent decades, computer usage and Internet access has become increasingly important for gathering information, looking for jobs, and participation in a changing world economy. See also: The NTIA’s Digital Nation Data Explorer which includes raw data for the Internet and Computer Use Survey
- Code Lists, Definitions, and Accuracy – View the detailed codes and definitions for variables, statistical testing, and an explanation of sample design, methodology, and accuracy for the American Community Survey.
- Surveys & Programs – The U.S. Census Bureau conducts more than 130 surveys and programs each year. This is a list of all of them.
- Are Millennials making less than their parents? When you check recent Census data, you get a completely different interpretation. Table P-10 located here lays out median/mean income for those 25 to 34 going back to 1974, adjusted to constant 2018 dollars. It starts at line 104. Quite clearly those aged 25-34 in 2018 are doing better than any group in the 1980s since the median income of \$37,133 is higher than the 1980s peak of \$33,356.
- The National Survey of Children’s Health (NSCH) examines the physical and emotional health of children ages 0-17 years of age. Special emphasis is placed on factors related to the well-being of children. These factors include access to - and quality of - health care, family interactions, parental health, neighborhood characteristics, as well as school and after-school experiences.
The Bureau of Labor Statistics (BLS) data
The Bureau of Labor Statistics
- Job Openings and Labor Turnover Survey (JOLTS)
- Local area unemployment
- BEA : Regional Economic Accounts: Download – GDP per state data
- Archived News Releases : U.S. Bureau of Labor Statistics – Self explanatory.
- Calendar year aggregate expenditure shares across selected groups by demographic characteristics
Bureau of Economic Analysis (BEA) data
Bureau of Economic Analysis
- Bureau of Economic Analysis Input-Output tables
- Interactive Data Tables
- State GDP Estimates Under Quarterly Gross Domestic Product by State, you can chart out the “Contributions to percent change in real GDP (SQGDP11)” which should give you the basic breakdown by industry. BEA’s tools aren’t the best, so you might need to export it.
Other government data
- Data.gov – The home of the U.S. Government’s open data.
- Rural Urban Continuum Codes – The Rural-Urban Continuum Codes form a classification scheme that distinguishes metropolitan counties by the population size of their metro area, and nonmetropolitan counties by degree of urbanization and adjacency to a metro area.
- The Rural Atlas – If you need to do analysis of counties, the Rural Atlas is your go to. It is the most up-to-date document pulling together population and economic data from the 5 year ACS.
- O*NET OnLine – ONET contains hundreds of standardized and occupation-specific descriptors on almost 1,000 occupations covering the entire U.S. economy. The database, which is available to the public at no cost, is continually updated from input by a broad range of workers in each occupation.
- USA Spending – USAspending.gov is the official source for spending data for the U.S. Government.
- About Underlying Cause of Death, 1999-2019
- A list of construction indices
- Interagency Data Inventory – The Interagency Data Inventory is a product of the Data Committee of the Financial Stability Oversight Council (FSOC). The inventory catalogs the data collected by FSOC member organizations. The inventory contains information — metadata — about each data collection. It does not contain the underlying datasets. For each data collection, the inventory has basic information, such as a brief description of the collection, collecting organization, and the name and number of the form used to collect the data.
- The Private School Survey produces data similar to that of the NCES Common Core of Data (CCD) for the public schools. The data are useful for a variety of policy- and research-relevant issues, such as the growth of religiously-affiliated schools, the length of the school year, the number of private high school graduates, and the number of private school students and teachers.
- The primary purpose of the Common Core of Data (CCD) is to provide basic information on public elementary and secondary schools, local education agencies (LEAs), and state education agencies (SEAs) for each state, the District of Columbia, and the outlying territories with a U.S. relationship.
- Total factor productivity – From Eli Dourado: “Total factor productivity captures how much output can be produced with a diverse but fixed basket of inputs. As technology and institutions improve, TFP goes up. As they deteriorate, it goes down. In the last decade, TFP has deeply stagnated. While there are numerous estimates of total factor productivity in the US, only the series maintained by the Federal Reserve Bank of San Francisco is quarterly and attempts to adjust for the business cycle. Since this series is not available in FRED, I am making it available here in graphical form.”
- The Atlanta Fed’s Wage Growth Tracker
State and local government data
- Correlates of State Policy | IPPSR – The Correlates of State Policy Project aims to compile, disseminate, and encourage the use of data relevant to U.S. state policy research, tracking policy differences across the 50 states and changes over time. We have gathered more than 900 variables from various sources and assembled them into one large, useful dataset. We hope this project will become a “one-stop shop” for academics, policy analysts, students, and researchers looking for variables germane to the study of state policies and politics. R package Shiny App
- The Fiscally Standardized Cities (FiSC) database allows users to create a custom table with fiscal information for selected cities. To create a table, select one or more cities, one or more years, and one or more fiscal variables. The default display options can be also adjusted, and users can choose whether to display data for FiSCs and/or one of the component governments (Cities, counties, school districts, and special districts).
- State expediture report – This annual report examines spending in the functional areas of state budgets: elementary and secondary education, higher education, public assistance, Medicaid, corrections, transportation, and all other. It also includes data on capital spending by program area, as well as information on general fund and transportation fund revenue collections.
- Annual Survey of State and Local Government Finances is the only source of nationwide, comprehensive local government finance information. It provides statistics on revenue, expenditure, debt, and assets for the 50 states and D.C.
- IPUMS – IPUMS provides census and survey data from around the world integrated across time and space. IPUMS integration and documentation makes it easy to study change, conduct comparative research, merge information across data types, and analyze individuals within family and community contexts.
- Eurostat – Eurostat is the statistical office of the European Union.
- NORC at U of Chicago – NORC experts conduct research in a wide range of subjects, bringing insight to topics including education, economics, global development, health, and public affairs. NORC is a solid alternative to Census and BLS surveys.
- Data USA – Maintained by Deloitte and Datawheel, DataUSA has a lot of databases. I use this site to grab data about cities and states.
- Damodaran’s finacial datasets | industry cap ex, risk premiums, etc – Aswath Damodaran is the GOAT: “Since I teach valuation and corporate finance, I am constantly collecting and analyzing data, and I have found that the data, once analyzed, can be used multiple times. Since I already have the processed data, I could not see any harm from sharing that data with others, thus saving us all some collective time, which we can spend far more productively not just on valuation but also with family and friends.”
- State expediture report – This annual report examines spending in the functional areas of state budgets: elementary and secondary education, higher education, public assistance, Medicaid, corrections, transportation, and all other. It also includes data on capital spending by program area, as well as information on general fund and transportation fund revenue collections.
Compendiums of data:
- Historical data library
- asdfree – An archive of the analyze survey data for free website, that provides step-by-step instructions to analyze major public-use survey data sets from the website that’s easy to type
- Cool Datasets – As they used to say, “A place to find cool datasets.” From Archive.org.
- Awesome Public Datasets – This list of topic-centric public data sources in high quality. They are collected and tidied from blogs, answers, and user responses. Most of the data sets listed below are free, however, some are not. Other amazingly awesome lists can be found in sindresorhus’s awesome list.
- I³ Open Innovation Dataset Index – This is the web version of the I³ Open Dataset Index, a collection of innovation datasets, and related tools, platforms and resources used by the broader research community.
Telecom & tech datasets
- Ookla Open Datasets [source] [R library]
- Raw Form 477 data; See also the interactive map, which is much easier to use
- NTIA’s Indicators of Broadband Need Map
- Internet Access Service Report data - Data on Internet access connections in the United States as collected by FCC Form 477 twice a year.
- List of 40 specialized FCC databases, such as radio call signs and equipment authorization
- Measuring Fixed Broadband Data
- Measuring Broadband America Program - Fixed Reports Data (Whitebox data)
- IAD Data & Statistical Reports
- Digital Nation Data Explorer include raw data for the Internet and Computer Use Survey
- OECD Broadband statistics
- The Emergency Broadband Benefit tracker is supported by USAC. There is also more data here and here.
- USAC provides Lifeline participation program data as well as disbursement data.
- Early Release Reports on Wireless Substitution at the CDC
- To research the FCC’s Electronic Comment Filing System (ECFS), you can conduct a (Proceeding Search) or a (Filing Search). It is also important to keep track of the FCC actions and you can do that through both their blog and their headlines.
- FCC Auctions: All Auctions
- Cybertelecom
- Federal Register // FCC
- Comms Act
- Download Data - National Broadband Map
- National Broadband Map Datasets | BroadbandUSA - NTIA
- FCC Form 477 Data
- Internet Access Services Reports | Federal Communications Commission
- Reports | Federal Communications Commission
- Reports | Federal Communications Commission
- H.R.1555 - 104th Congress (1995-1996): Communications Act of 1995 | Congress.gov | Library of Congress
- S.652 - 104th Congress (1995-1996): Telecommunications Act of 1996 | Congress.gov | Library of Congress
- Ookla Open Datasets [source]
- My continuing research on broadband will live in this repo. Broadband adoption and use analysis
- Michael Kotrous put all of his data in this repo, which supports his academic work in broadband. Highly suggested.
- Raw Form 477 data; See also the interactive map, which is much easier to use
- Internet Access Service Report data - Data on Internet access connections in the United States as collected by FCC Form 477 twice a year.
- List of 40 specialized FCC databases, such as radio call signs and equipment authorization
- Measuring Fixed Broadband Data
- Fixed Reports Data – Commonly called Whitebox data.
- FCC IAD Data & Statistical Reports – The Office of Economics and Analytics Industry Analysis Division releases periodically the reports and data listed below. Much of this information is based on forms that service providers file with the FCC.
- NTIA’s Digital Nation Data Explorer include raw data for the Internet and Computer Use Survey
- OECD Broadband statistics
Other datasets
- Vertical Farming - [link]
- Twitterstream from Archive. A simple collection of JSON grabbed from the general twitter stream, for the purposes of research, history, testing and memory. This is the “Spritzer” version, the most light and shallow of Twitter grabs. Unfortunately, we do not currently have access to the Sprinkler or Garden Hose versions of the stream. [Archive]
- The California Forest Observatory is a data-driven forest monitoring system that maps wildfire hazard drivers across California, including forest structure, weather, topography, and infrastructure.
- OpenStreetMap provides a broad range of map data maintained by a worldwide community of geographers and cartographers.
- The Registry of Open Data on AWS has empowered laboratories, research institutions, and various other organizations to deliver open datasets to developers, startups, and enterprises worldwide since its launch in 2018.
- Nasa Earth Observations offer climate and environmental data for the globe. You can browse and download the satellite data from NASA’s constellation of Earth Observing System satellites. Over 50 different global datasets are represented with daily, weekly, and monthly images available in various formats.
- A Google BigQuery public dataset is any dataset made available to the general public through the Google Cloud Public Dataset Program.
- Koordinates is an emerging geospatial data management platform where you can host, manage, share, publish, and access geodata.
- Natural Earth is a collection of public domain map datasets available in vector or raster formats and various scales.
- Safegraph offers some open census data and neighborhood demographics.
- The Canadian government has its own Open Data Portal.
- An open dataset of electric vehicles and their specs.
- Open Zone Map - the largest dataset and only interactive map of Special Economic Zones.
Legal and legislative research
- Legal News & Analysis on Litigation, Policy, Deals : Law360
- Lexology
- GovTrack.us: Tracking the U.S. Congress
- NCSL 50-State Searchable Bill Tracking Databases
- LegiScan | Bringing People to the Process
- USA Spending
- Transcripts of Congressional Hearings
- Reports from the Rural Electrification Administration
- Every CRS report
- Oversight.gov is a publicly accessible, searchable website containing the latest public reports from Federal Inspectors General who are members of the Council of the Inspectors General on Integrity and Efficiency.
- The Federal Trade Commission (FTC) is mainly a policing agency, and thus doesn’t have the rulemaking authority of the FCC, so the best way to keep informed is to follow their blog and their news and events.
- Committees of Jurisdiction:
- House of Representatives: Energy & Commerce Committee; Communication and Technology Subcommittee; Judiciary Committee; The Subcommittee on Courts, Intellectual Property, and the Internet
- Senate: Committee on Commerce, Science, and Transportation; Communications, Technology, and the Internet Subcommittee; Judiciary Committee; Privacy, Technology and the Law Subcommittee; Subcommittee on Antitrust, Competition Policy and Consumer Rights Subcommittee
- The Unified Agenda – “The Biden Administration’s Unified Agenda of Regulatory and Deregulatory Actions (Agenda) reports on the actions administrative agencies plan to issue in the near and long term. Released by the Office of Information and Regulatory Affairs, the Agenda provides important public notice and transparency about proposed regulatory and deregulatory actions within the Executive Branch.”
- The Plum Book – “The U.S. Government Publishing Office (GPO) has made the United States Policy and Supporting Positions, or the “The Plum Book”, available in print, online and as an app. The publication lists more than 9,000 Federal civil service leadership and support positions in the executive and legislative branches that may be subject to noncompetitive appointment. First introduced in 2012, The Plum Book app allows users to view the publication in an easy to use mobile format and search the publication by agency, position title, location, appointment type, pay plan and level, tenure and term expiration, and vacancy.”
- Open States: discover politics in your state - Open States
- ICT Reg Toolkit
- Federal Register :: Document Search
- Reg Body of Knowledge – Developed by the Public Utility Research Center (PURC) at the University of Florida, in collaboration with the University of Toulouse, the Pontificia Universidad Catolica, the World Bank and a panel of international experts, the Body of Knowledge on Infrastructure Regulation (BoKIR) summarizes some of the best thinking on infrastructure policy.
General research
- PhilPapers: Online Research in Philosophy
- Oral History Collection | Computer History Museum - CHM’s oral history program records and preserves the firsthand recollections of computing pioneers and innovators from around the world. With over a thousand in-depth interviews and demonstrations, CHM’s oral history collection is one of the most comprehensive on the subjects of computing, technological innovation, web, entrepreneurship, and networking. Also just Computer History Museum
- The Winnower | Open Scholarly Publishing
- Machine Learning and Artificial Intelligence
- AITopics
- Wikisum: Summaries of political science research [link]
- Internet Sacred Text Archive Home
- HathiTrust Digital Library
- Glossery of municipal securities terms
- Use cheatography.com to find cheat sheets on every subject.
- Classics in the History of Psychology
- ORBIS - ORBIS allows us to express Roman communication costs in terms of both time and expense. By simulating movement along the principal routes of the Roman road network, the main navigable rivers, and hundreds of sea routes in the Mediterranean, Black Sea and coastal Atlantic, this interactive model reconstructs the duration and financial cost of travel in antiquity.
- “A Layman’s Guide to Recreational Mathematics Videos” by Sam Enright
Data science tools and resources
- The R Book
- Jupyter Notebook Viewer
- Learn python3 in Y Minutes
- A gallery of interesting Jupyter and IPython Notebooks - jupyter/jupyter Wiki
- data-wrangling-cheatsheet
- Cheatsheet for dplyr join functions
- Using R for Time Series Analysis
- R Resources (Cheatsheets, Tutorials, & Books)
- ggplot2 histogram plot : Quick start guide - R software and data visualization - Easy Guides - Wiki - STHDA
- How to Visualize and Compare Distributions | FlowingData
- R Programming/Probability Distributions - Wikibooks, open books for an open world
- Quick-R: Regression Diagnostics
- Quick-R: Multiple Regression
- House Prices: Advanced Regression Techniques | Kaggle
- How to Make High Quality Data Visualizations for Websites With R and ggplot2
- Machine Learning, Data Science, Big Data, Analytics, AI
- Common statistical tests are linear models (or: how to teach stats)
- Chris Albon’s data science reading list [Archive]
- Khan Academy economics
- On Computational Thinking, Inferential Thinking and Data Science [YouTube]
- When Should I Use Confidence Intervals, Prediction Intervals, and Tolerance Intervals [Minitab]
- A Semi-Supervised Classification Algorithm using Markov Chain and Random Walk in R [Sandipanweb]
- Distributed K-Means with R-Hadoop [Sandipanweb]
- mlshowcase – machine leaning showcase
- Detailed Exploratory Data Analysis using R [Kaggle]
- Comprehensive data exploration with Python [Kaggle]
- NeuralNetworkLawfirmNames – Using torch, generated list of law firms, trained using Law360’s Top 400 law firms by size [Github]
- Python word2vec [link]
- GeoDa [link]
- A catalog of resources for learning ML keep an eye on this
Rhetoric
- Analysis: To get started with rhetorical analysis, start with “Basic questions answered by a rhetorical analysis” and then read the entirety of “A guide to the Toulmin Method.” Also: A short handbook on rhetorical analysis [Archive]; Rhetoric and composition resources [Archive]
- The Forest of Rhetoric [silva rhetoricae] – This online rhetoric, provided by Dr. Gideon Burton of Brigham Young University, is a guide to the terms of classical and renaissance rhetoric.
- [Aesthetics Wiki] is a comprehensive encyclopedia of online and offline aesthetics! We are an online community dedicated to the identification, observation, and documentation of visual schema. … Enjoy your stay!
- Classics: The Internet Classics Archive | Perseus Digital Library | “Shakespeare’s debt to Ovid” by Jeremy McNamara, The 1992-93 Fox Classics Lecture | William Whitaker’s Words | The Internet Classics Archive | LacusCurtius - Greek and Latin Texts
- Guide to Grammar and Style by Jack Lynch
- Common mistakes of grammar
- (f)ragments is a scholar’s site devoted to advanced research in rhetorical and communication theory. [Archive]
- Quotes to ‘Re-Imagine’ Schools for the 21stC [Archive]
- Heresthetics
Miscellaneous lectures, notes, + resources
- Free Text to Speech Online with Natural Voices
- Paste to Markdown
- CPI Inflation Calculator
- Docx2LaTeX Automatic converter Dashboard
- Real time satellite tracking [link]
- Mapwarper is a free to use, open source map warper / map georectifier, and image georeferencer tool for individuals and small groups.
- Hemingway Editor
- Five Books
- Find A Spring – Find A Spring is a user-generated map of cold and hot springs. We provide locations, photos, reviews, water test results, comments and more.
- Through the Great 78 Project the Internet Archive has begun to digitize 78rpm discs for preservation, research, and discovery with the help of George Blood, L.P.. 78s were mostly made from shellac, i.e., beetle resin, and were the brittle predecessors to the LP (microgroove) era. [link]
- Sharkle! - Generator of random awesomeness http://sharkle.com
- IEEE list of milestones
- Washington DC websites:
- A collection of several hundred online tools for OSINT | Hacker News
- U Penn’s Online Books Page – Listing over 3 million free books on the Web
- History: A summation of all knowledge
- The Best Textbooks on Every Subject from Less Wrong
- Introduction to Astronomy
- The General Index : Public Resource : Free Download, Borrow, and Streaming : Internet Archive
- Sciencemadness library
- Physics of Information