perform big data data cleansing