Telemetry/Custom analysis with spark: Difference between revisions

Show how to distribute a repo
(Fix broken link to Telemetry Hello World notebook)
(Show how to distribute a repo)
Line 178: Line 178:


2. Put %%capture at the beginning of the cell to store all output. [https://ipython.org/ipython-doc/3/interactive/magics.html#cellmagic-capture See the documentation].
2. Put %%capture at the beginning of the cell to store all output. [https://ipython.org/ipython-doc/3/interactive/magics.html#cellmagic-capture See the documentation].
=== How do I load an external library into the cluster? ===
Assuming you've got a url for the repo, you can distribute the egg this way:
  !git clone <repo url> && cd <repo-name> && python setup.py bdist_egg
  sc.addPyFile('<repo-name>/dist/my-egg-file.egg')
29

edits