The following are some of the things linkedin has implemented with regards to big data: automate workflows : with azkaban, an open source tool that runs inside hadoop. Extract transform load (etl) : helps collect data from various databases to the hadoop system. Use of ad-hoc querying languages : spark […]