算法工程师-机器学习-数据科学家面试准备4-ML系统设计
阅读资料
- https://github.com/LongxingTan/Machine-learning-interview
- https://www.youtube.com/playlist?list=PLlvnxKilk3aKx0oFua-HTtFf-d_inQ8Qn
- https://github.com/ByteByteGoHq/ml-bytebytego
- 小红书资料
数据仓库
- used for the analysis and reporting of structured and semi-structured data from multiple data sources, such as point-of-sale transactions, marketing automation, customer relationship management, and more.
- include an analytical database and critical analytical components and procedures
- support ad hoc analysis and custom reporting, such as data pipelines, queries, and business applications
bagging
- Bootstrap aggregating
- reduces variance and helps to avoid overfitting