watch the new talk and write summary Noah Smith: squash network Main points: difference between LSA & SVD Bayesian graphical models informative priors are useful in the model Bayesian network DAG X1X2…Xn Po(X1, X2, …, Xn) Generative story: HMM (dependencies) A and B are conditionally independent given C iff P(A,B|C) = P(A|C) * P(B|C) … Read moreLecture 5: Reduced-dimensionality representations for documents: Gibbs sampling and topic models
GLuint posLength = sizeof(PointStruct) * PointCloudData.size(); correct GLuint posLength = sizeof(PointCloudData) ; wrong glBufferData(GL_ARRAY_BUFFER, posLength, &PointCloudData, GL_STATIC_DRAW);
Cross entropy H(p,q) = D(p||q) + H(p) H(p) is some inherent randomness in p D(p||q) is what we care about. we can try to get D(p||q) by calculating cross entropy. Conclusion: a model is good is that it assign good approximation to the observed data. So we need to find some good q Main points: … Read moreHello
Today’s class is about: Hypothesis testing collocations Info theory Hypothesis Testing Last lecture covered the methodology. Collocation “religion war” PMI, PPMI PMI = pointwise mutual information PMI = log2(P(x,y)/(P(x)P(y))) = I(x,y) PPMI = positive PMI = max(0, PMI) Example: Hong Kong, the frequency of “Hong” and “Kong” are low, but the frequency for “Hong Kong” … Read moreLecture3: Information Theory
Basically, the time spent on testing depends on: the complexity of the neural network For example, the fastest network should be the fully-connected network. CNN should be faster than LSTM because LSTM is sequential (sequential = slow) Currently, there are many ways to compress deep learning model (remove nodes with lighter weight) the complexity of … Read moreThe test speed of neural network?