PyData 06/14/2016 in Paris#
Content#
Main presentation
A couple of notebooks require to be run to see the results because a naive conversion does not take into account javascript dependencies (pythreejs, vega, brython) or does not work at all because it involves a server (bqplot).
Static libraries
Interactive libraries
Pure javascript
Big Data
Mix between Python and Javascript
Not covered by this presentation
altair: an example on how to wrap Vega in a more Pythonic way
flexx : very promising way to plug javascript graphs written in Python, this module contains a tools which converts Python into javascript
graphviz: famous library to draw graph, trees. I skipped because all the wrappers are not self contained and require to install graphviz first.
HoloViews: useful to build complex and linked graphs, look at Pandas Conversion
ipyleaflet: offers similar tools than folium, see an example on how to interact with bqplot: An Analysis of Well-Being in San Francisco
kartograph : maps
nglview: animate molecular structures
pandastable : IDE to look at dataframes
python-gantt : to draw Gantt charts
pytraj: analyze of molecular dynamics trajectories and displays
pyxley: web app on Flask
toyplot: PDF, SVG, MP4 rendering
vaex: the speaker just after me and the library is able to cope with big data at a very high scale
Dig into building a Jupyter extension
Links
From others presentations#
The presentation which follows showed how to use d3.js. It was amazingly easy and understandable: Building Visualisations in d3.js for Python Programmers by Thomas Parslow.
The talk on software-carpentry was also quite interesting as they developed strong experience in animating workshop. Surprising to see so many initiatives to educate people on programming.
Some links taken from presentations:
pyspark-ide-starter: setup for Spark
From scikit-learn to Spark ML: tutorial to switch from scikit-learn to pyspark
The presentation by Nexedis was quite impressive. They introduced their stack to process data mostly based on open source projects:
Fluentd: a software which collects and sends data from your laptop. Acccording to the speaker (Jean-Paul Smets), it loses 1 byte out of 10 millions, even if you close your laptop at anytime.
Re6st: Resilient, Scalable, IPv6 Network, find routes between two locations in Internet. According to the speaker, it is much more reliable than standard routing which always takes the same paths. It is like taking small roads instead of highways.
neoppod: NEO is a distributed, redundant and scalable implementation of ZODB API. NEO stands for Nexedi Enterprise Object.
Erp5: written in Python, see Python Success Stories. ERP5 is a full featured high end Open Source / Libre Software solution published under GPL license and used for mission critical ERP / CRM / MRP / SCM / PDM applications by industrial organisations and government agencies.
SlapOS: SlapOS is a decentralized Cloud Computing technology that can automate the deployment and configuration of applications in a heterogeneous environment.
MariaDB: MariaDB is designed as a drop-in replacement of MySQL(R) with more features, new storage engines, fewer bugs, and better performance.
wendelin.core Out-of-core NumPy arrays. ZBigArray can cope with any size of data from any container (memory, file, data base, …) and should work with sikit-learn (to be continued).
The most interesting part of the talk was about the way the company decided to base their processes on a particular libraries, especially for Fluentd.
No connection but I heard the following in presentations. Github added a new features which allows users to edit directly from the browser. It is very useful to fix typos and documentation: Editing files in your repository. Somebody would to add Functional PCA to scikit-learn. thebe is a javascript libraries which makes it easy to call a server to run Python code from a web page, kind of simplified notebook to build documentation (source). A paper: Sparse pairwise Markov model learning for anomaly detection in heterogeneous data. The MIT proposes a pretrained CNN (Convolution Neural Network) for places: