Skip to content

AbrLiu/papers

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

37 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

papers

good good study, day day up

Topic Reviews

Distributed System

Theorem

  • Brewer’s Conjecture and the Feasibility of Consistent, Available, Partition-Tolerant Web | pdf | notes
  • BASE: An Acid Alternative | pdf | notes
  • Eventually consistent | pdf
  • Conflict-free Replicated Data Types | pdf
  • The Byzantine Generals Problem | pdf
  • The Part-Time Parliament | pdf
  • Paxos Made Simple | pdf

Storage

  • Bitcask - A Log-Structured Hash Table for Fast Key/Value Data | pdf | notes
  • The Google File System | pdf | notes
  • Bigtable: A Distributed Storage System for Structured Data | pdf
  • The Chubby lock service for loosely-coupled distributed systems | pdf

Processing Model

  • MapReduce: Simplified Data Processing on Large Clusters | pdf | notes

others

  • Web search for a planet: The Google cluster architecture | pdf

NLP

Basic Models

  • Speech and Language Processing | pdf
  • A Stochastic Parts Program and Noun Phrase Parser for Unrestricted Text | pdf
  • A Statistical Approach to Machine Translation | pdf
  • Spam Filtering with Naive Bayes – Which Naive Bayes? | pdf

Undirected Graphical Model

alias: Markov network, Markov Random Field (MRF)

HMM, MEMM, CRF

  • introduction to Hidden Markov Model pdf Chap.8 Part-of-Speech Tagging
  • Conditional Random Fields - Probabilistic Models for Segmenting and Labeling Sequence Data | pdf

Near-duplicate Detection

  • N-gram Similarity and Distance | pdf
  • A Closer Look at Skip-gram Modelling | pdf
  • Similarity Estimation Techniques from Rounding Algorithms | pdf
  • Detecting Near-Duplicates for Web Crawling | pdf

Graphs

  • Finding and evaluating community structure in networks | pdf
  • Fast algorithm for detecting community structure in networks | pdf

About

good good study, day day up

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors