- BlingFire lightning fast Finite State machine and REgular expression manipulation library
- spacy NLP course
- Pilosa is an open source, distributed bitmap index that dramatically accelerates queries across multiple, massive data sets.
- Simple, Composable, Open Source ETL - SINGER
- custom but still catalyst optimized spark UDF and UADF
- Frameworks for Machine Learning Model Management
Data links KW 17
some useful & interesting links