生成式AI领域的最新成果都在这里!抢 QCon 展区门票 了解详情
写点什么

GitHub 大规模采用机器学习的痛点和破解之道

  • 2020-02-08
  • 本文字数:993 字

    阅读完需:约 3 分钟

GitHub 大规模采用机器学习的痛点和破解之道

ArchSummit 北京 2019 大会上,Jose David Baena 讲师做了《GitHub 大规模采用机器学习的痛点和破解之道》主题演讲,主要内容如下。


演讲简介


Title: Adopting Machine Learning at Scale


Scaling up machine-learning (ML), data retrieval and reasoning algorithms from Artificial Intelligence (AI) for massive datasets is a major technical challenge in our time. The scaling process can also have different dimensions: performance, development productivity, number of employees…


In this talk I will showcase how we used to develop Machine learning features at GitHub, the pain points we had and how we changed our infrastructure and way of development in order to productionize multiple ML features in terms of hours/days.


In addition, I will explore with the audience the main factors I consider when scaling ML at medium to big companies.


By the end of the talk you should have an overview and applicable framework on how to help scaling ML processes in your company.


Talk outline


Potential outline for the talk:


  • Introduction to ML at GitHub.

  • Challenges of running ML at scale. Different dimensions:

  • Performance: number of requests

  • Development: growing infrastructure, number of ML features

  • Organizational: number of employees

  • ML ecosystem architecture.

  • Improving agility and development on ML features.

  • Adopting ML at scale in your company.


讲师介绍


Jose David Baena,GitHub Senior Software Engineer。


Jose David Baena is a Senior Software Engineer at GitHub. He has more than 10 years experience in backend development, from startups to big companies, from Europe to the United States.


His experience ranges from building distributed low latency systems for financial companies to high performant crawlers for social media.


At the moment, he designs architectures that are used by the Machine Learning and Data Science teams at GitHub. He is passionate about distributed systems, machine learning scalability and developer productivity.












完整演讲 PPT 下载链接


https://archsummit.infoq.cn/2019/beijing/schedule


2020-02-08 18:35453

评论

发布
暂无评论
发现更多内容

架构师训练营第 2 周学习总结

Binary

极客大学架构师训练营

架构师训练营 2 期 Week07 作业

第十周作业 (作业一)

Geek_83908e

架构师一期

「架构师训练营第 1 期」第十一周作业

张国荣

性能优化 - 学习总结笔记

Xuenqlve

架构师训练营 2 期 Week07 总结

第七周作业

hunk

极客大学架构师训练营

第七周-总结

jizhi7

极客大学架构师训练营

第七周大作业

小兵

第十一周作业

wanlinwang

极客大学架构师训练营

架构师训练营week11总结

FG佳

先从哪里开刀-组织形式还是制度安排

luojiahu

组织思考

架构师训练营 1 期 - 第 十一周总结(vaik)

行之

极客大学架构师训练营

【架构师训练营第 1 期 11 周】 学习总结

Bear

极客大学架构师训练营

Week 11 作業

Judyyy

第七周-作业

jizhi7

week02

ルンルン

Spock单元测试框架实战指南二-mock第三方依赖

Java老k

Java 单元测试 JUnit spock

第七周作业总结

hunk

极客大学架构师训练营

架构师训练营 1 期 - 第 十一周作业(vaik)

行之

极客大学架构师训练营

架构师训练营第十一周学习总结

文智

极客大学架构师训练营 架构师一期

【架构师训练营第 1 期 11 周】 作业

Bear

极客大学架构师训练营

Week 11 學習總結

Judyyy

第 7 周作业

Steven

极客大学架构师训练营

架构第十一周作业

Geek_Gu

极客大学架构师训练营

架构第十一周总结

Geek_Gu

极客大学架构师训练营

架构师训练营 - week11 - 作业

lucian

极客大学架构师训练营

架构1期 第十一周作业

haha

极客大学架构师训练营

架构师训练营第十一周作业

文智

极客大学架构师训练营

第二周课后练习

Binary

极客大学架构师训练营

第十一周作业(作业二)

Geek_83908e

架构师一期

GitHub 大规模采用机器学习的痛点和破解之道_ArchSummit_Jose David Baena_InfoQ精选文章