Data Team
Overview
Modern research projects are vast and complex and incorporate many partner, funder, and publisher requirements. The Data Team works to fill the information gap at Global TIES for Children to develop and use data products for internal and external teams.
-
The Data Team responds to two major needs:
The need for timely and accurate datasets that meet FAIR standards and allow for replication and reuse
The need for deliberate engagement with emerging open science and data security principles and practices
The Data Team presents an opportunity to meet these needs via innovation through application of new technologies, software, and methods (e.g. machine learning). More and more publishers are requiring datasets for publication that require these needs. We strive to have our datasets follow a “TIES” standard and format that allow them to be recognized as a Global TIES for Children public good. We aim to advance the development and practice of these standards in the EiE realm.
-
The Data Team answers these needs by thoroughly integrating into the research lifecycle. We offer a handful of services for research teams to meet these needs:
Pre- and post-award support: Draft, review, and provide feedback on data use agreements, IP terms, and data sharing provisions in contracts
Data collection support: Consult with research teams on KoBo/data collection software programming to reduce manual entry errors, provide guidance on nightly checks, etc.
Dataset production for internal use: Data harmonization, verification, and production of datasets within:
At most 1 month for nightly data for preliminary analysis
At most 3 months for analysis data for publishable analyses
Basic descriptive analyses and reports: Production of descriptive reports (e.g., item and scale descriptives, data visualizations) that can be used internally and shared with partners; can link to psychometric report packages
Active data curation:
Production and dissemination of datasets that meet replication and reuse standards and adhere to FAIR standards
Promotes cross-Center coherence in our internally- and externally-shared data
Internal capacity strengthening:
Centralized information management with off-the-shelf or custom tooling, including the Item Bank for cross-project investigation
Workshops and targeted support on R, Git, data dashboards, citation metrics, etc.
External capacity strengthening: Modules on data standards and database basics
Innovation iteration: Application of data science principles and technologies to research team problem
Research impact: Creating and adapting analytic, predictive modeling, and data visualization tools to measure the impact of TIES publications, datasets, reports and etc in comparison to academic field and SDG benchmarks
Spotlight
NYU-TIES featured at the Research Data and Assess Preservation Virtual Summit 2023
NYU-TIES' Senior Data Associate Daniel Woulfin was one of the panelists on the Data Curation panel at the Research Data and Assess Preservation Virtual Summit 2023, alongside other data curation experts from Penn State University, ICPSR at University of Michigan Institute for Social Research, and the University of Michigan. Daniel gave a lightning talk on his open source web application ,diyddi, which allows researchers to curate their own dataset metadata and generate codebooks and documentation.