.. _l-pydata2016: PyData 06/14/2016 in Paris ========================== .. sharenet:: :facebook: 1 :linkedin: 2 :twitter: 3 :head: False Content +++++++ **Main presentation** * :ref:`10plottinglibrariesrst` A couple of notebooks require to be run to see the results because a naive conversion does not take into account javascript dependencies (pythreejs, vega, brython) or does not work at all because it involves a server (bqplot). **Static libraries** * :ref:`imcartopyrst` * :ref:`imbiopythonrst` * :ref:`imete3rst` * :ref:`imlifelinesrst` * :ref:`immatplotlibrst` * :ref:`immissingnorst` * :ref:`immplscatterdensityrst` * :ref:`imnetworkxrst` * :ref:`implotninerst` * :ref:`imreportlabrst` * :ref:`imscikitplotrst` * :ref:`imseabornrst` **Interactive libraries** * :ref:`jsbokehrst` * :ref:`jslightningpythonrst` * :ref:`jsmpld3rst` * :ref:`jsplotlyrst` * :ref:`jspydymassspringdamperrst` * :ref:`jspyechartsrst` * :ref:`jspygalrst` * :ref:`jspythreejsrst` * :ref:`jsvegarst` **Pure javascript** * :ref:`jsonlytreantrst` **Big Data** * :ref:`bigdatashaderrst` **Mix between Python and Javascript** * :ref:`pyjsbqplotrst` * :ref:`pyjsbrythonrst` * :ref:`pyjscvispyrst` **Not covered by this presentation** * `altair `_: an example on how to wrap `Vega `_ in a more Pythonic way * `blockdiag `_: interesting alternative to :epkg:`Graphviz` * `flexx `_ : very promising way to plug javascript graphs written in Python, this module contains a tools which converts Python into javascript * `graphviz `_: famous library to draw graph, trees. I skipped because all the wrappers are not self contained and require to install `graphviz `_ first. * `HoloViews `_: useful to build complex and linked graphs, look at `Pandas Conversion `_ * `ipyleaflet `_: offers similar tools than `folium `_, see an example on how to interact with `bqplot `_: `An Analysis of Well-Being in San Francisco `_ * `kartograph `_ : maps * `nglview `_: animate molecular structures * `pandastable `_ : IDE to look at dataframes * `python-gantt `_ : to draw `Gantt charts `_ * `pytraj `_: analyze of molecular dynamics trajectories and displays * `pyxley `_: web app on Flask * `toyplot `_: PDF, SVG, MP4 rendering * `vaex `_: the speaker just after me and the library is able to cope with big data at a very high scale **Dig into building a Jupyter extension** * `Js extensions `_ * `Distributing Jupyter Extensions as Python Packages `_ * `Notebook extensions `_ *Links* * `The Python Visualization Landscape `_ .. _l-pydataparis-notes: From others presentations +++++++++++++++++++++++++ The presentation which follows showed how to use `d3.js `_. It was amazingly easy and understandable: *Building Visualisations in d3.js for Python Programmers* by Thomas Parslow. The talk on `software-carpentry `_ was also quite interesting as they developed strong experience in animating workshop. Surprising to see so many initiatives to educate people on programming. Some links taken from presentations: * `pyspark-ide-starter `_: setup for Spark * `From scikit-learn to Spark ML `_: tutorial to switch from *scikit-learn* to *pyspark* The presentation by `Nexedis `_ was quite impressive. They introduced their stack to process data mostly based on open source projects: * `Fluentd `_: a software which collects and sends data from your laptop. Acccording to the speaker (Jean-Paul Smets), it loses 1 byte out of 10 millions, even if you close your laptop at anytime. * `Re6st `_: Resilient, Scalable, IPv6 Network, find routes between two locations in Internet. According to the speaker, it is much more reliable than standard routing which always takes the same paths. It is like taking small roads instead of highways. * `neoppod `_: NEO is a distributed, redundant and scalable implementation of ZODB API. NEO stands for Nexedi Enterprise Object. * `Erp5 `_: written in Python, see `Python Success Stories `_. ERP5 is a full featured high end Open Source / Libre Software solution published under GPL license and used for mission critical ERP / CRM / MRP / SCM / PDM applications by industrial organisations and government agencies. * `SlapOS `_: SlapOS is a decentralized Cloud Computing technology that can automate the deployment and configuration of applications in a heterogeneous environment. * `MariaDB `_: MariaDB is designed as a drop-in replacement of MySQL(R) with more features, new storage engines, fewer bugs, and better performance. * `wendelin.core `_ Out-of-core NumPy arrays. `ZBigArray `_ can cope with any size of data from any container (memory, file, data base, ...) and should work with `sikit-learn `_ (to be continued). The most interesting part of the talk was about the way the company decided to base their processes on a particular libraries, especially for *Fluentd*. No connection but I heard the following in presentations. Github added a new features which allows users to edit directly from the browser. It is very useful to fix typos and documentation: `Editing files in your repository `_. Somebody would to add `Functional PCA `_ to *scikit-learn*. `thebe `_ is a javascript libraries which makes it easy to call a server to run Python code from a web page, kind of simplified notebook to build documentation (`source `_). A paper: `Sparse pairwise Markov model learning for anomaly detection in heterogeneous data `_. The MIT proposes a pretrained CNN (Convolution Neural Network) for places: * `Places CNN `_, `Pre-release of Places365-CNNs `_