Non classé – Scikit-Learn Consortium

How are the priorities of the consortium defined?

The scikit-learn Consortium @ Inria defines a roadmap every six to eight months during its Technical Committee. Previous roadmaps are available here. Why a roadmap? The members of the Consortium provide their financial support without any service counterpart. The definition of a development and general activities roadmap is an important step in building trust between the members of the Consortium. It represents our effort to focusing together on…

22May2021

Generalized Linear Models have landed in scikit-learn

While scikit-learn already had some Generalized Linear Models (GLM) implemented, e.g. LogisticRegression, other losses than mean squared error and log-loss were missing. As the world is almost (surely) never normally distributed, regression tasks might benefit a lot from the new PoissonRegressor, GammaRegressor and TweedieRegressor estimators: using those GLMs for positive, skewed data is much more appropriate than ordinary least squares and might lead to more adequate models. Starting…

24Feb2021

Advisory Committee / February 8th 2021

Presentation of the activities of the Consortium during the last year (C. Marmo, G. Varoquaux): Questions and comments: Fujitsu Fujitsu actively participates in the Consortium remote events. Fujitsu would be glad to increase Japan contributions to scikit-learn. Fujitsu suggests organizing a sprint for Japan time zone, and starting a discussion about good practices to organize online sprints with the team there. Microsoft More information about the MOOC are…

15Feb2021

Implementing a faster KMeans in scikit-learn 0.23

The 0.23 version of scikit-learn was released a few days ago, bringing new features, bug fixes and optimizations. In this post we will focus on the rework of KMeans, a long going work started almost two years ago. Better scalability on machines with many cores was the main objective of this journey. It forced us to touch core challenges of low-level parallelism. KMeans clustering Before describing the optimization…

26May2020

Time to come out! scikit-learn 0.22

A new look and many new features for this 0.22 scikit-learn release. Just a bit earlier than Santa visiting, this past month some special Elves have worked really hard to keep the target of releasing scikit-learn twice a year. Come take a look at some of the many surprises this remarkable package contains. With big data come big responsibilities New features for plotting and interpretability Models fitted by…

6Dec2019

Fujitsu joins the Consortium

Fujitsu Laboratories join the Consortium. Fujitsu will thereby contribute to the sustainability of the scikit-learn development community. More information is available via the press releases published by Inria and Fujitsu.

11Nov2019

May 28, Tuesday: the first workshop of the consortium

All scikit-learn consortium partners are pleased to invite you to the first annual workshop Tuesday, May…

19Apr2019

Scikit-learn sprint in Paris

Three weeks ago, we organized a scikit-learn sprint in the AXA’s offices in Paris. No less than 37 persons attended the sprint during the week. Such effort is equivalent to a 6 man-month! While the sprint was organized by the scikit-learn fondation @ inria, it united a much wider group of contributors and it was funded by other organizations (see below). Improvements to scikit-learn This sprint saw the…

2Apr2019

ADVISORY COMMITTEE / NOVEMBER 8, 2018

Minutes of the scikit-learn Advisory Committee meeting (November 8th 2018) Where present: Chaouki Boutharouite - AXA Sébastien Conort - BNP Paribas Cardif Sylvain Duranton - BCG David Margery - Inria Foundation Olivier Trébucq - Inria Gaël Varoquaux - Inria Excused: Josh Patterson and Guillaume Barat - Nvidia Laurent Duhem - Intel 1- The current administrative and financial situation of the consortium Two engineers have already been recruited…

30Nov2018

TECHNICAL COMMITTEE / October 10, 2018

Technical Committee October 10, 2018 Priority list for the consortium at Inria, year 2018–2019 From the points discussed at the meeting, the Technical Committee is proposing the following list of priorities for the actions of the consortium, to be used by the management team for allocating consortium resources: Faster release cycle & dedicate resources for maintenance. Benchmark and compliance tests (Intel & Nvidia): scikit-learn-benchmarks →…

30Nov2018