Skip to content

Commit

Permalink
detail change
Browse files Browse the repository at this point in the history
  • Loading branch information
Victor0118 committed Dec 16, 2017
1 parent fb3bff3 commit 214769b
Showing 1 changed file with 7 additions and 2 deletions.
9 changes: 7 additions & 2 deletions paper/largelsh.tex
Original file line number Diff line number Diff line change
Expand Up @@ -562,15 +562,20 @@ \section{Summary}
both model performance (accuracy) and run time needed to attain the best accuracy, while at the same
time requiring less parameters to tune. Both LSH and spill trees can horizontally scale. To the best of
our knowledge, we are not aware of any exiting comparisons between spill trees and LSH for these
kinds of tasks using a modern parallel data processing framework. Despite the fact the LSH doesn't
perform as well as spill trees, there are still several selling points of LSH. Although having a large
kinds of tasks using a modern parallel data processing framework.
We notice that LSH doesn't
perform as well as spill trees in our experiment on SVHN dataset.
We argue that it might because there are some parameter tuning and optimization issues with the Apache Spark API.
Although having a large
number of tunable parameter maybe frustrating, at the same time it offers more flexibility in terms of
the accuracy vs. run time tradeoff. If we really needed some predictions with low latency while not
caring about accuracy as much LSH gives us that flexibility. Furthermore, LSH is an elegant idea that is
much simpler to understand on the conceptual level compared with spill trees and can serve as a good
baseline for these kinds of tasks in the future over the brute-force method, which can be infeasible to
compute.

In the future, we will try to optimize our implementation to allow it to better cooperate with the MapReduce fashion. Other tasks and better hard-ware computing resource are also worth considering to get a deep understanding on how approximate search works in nature.

\section*{Acknowledgement}
We thank Professor Samer Al-Kiswany for helpful discussions regarding the direction
of the project. We thank Microsoft for graciously providing Azure credits for
Expand Down

0 comments on commit 214769b

Please sign in to comment.