NSF Invests $4 Million in Big Data for Southern United States
Precision medicine and understanding health disparities, innovation to power competitive manufacturing, technology for smarter communities, and addressing coastal hazards such as hurricanes are among the challenges facing the Southern United States. A $4 million award from the National Science Foundation (NSF) will help apply data science and engineering to address those challenges.
The funding will continue support for the South Big Data Innovation Hub, an organization that helps 16 Southern States and the District of Columbia identify and utilize data science and engineering to address critical societal needs. One of four NSF-supported regional data hubs in the U.S., the South Big Data Hub is managed by the Georgia Institute of Technology and the University of North Carolina at Chapel Hill.
“The Big Data Hubs provide a connective tissue for the data science ecosystem across sectors and domains,” said Renata Rawlings-Goss, the Hub’s executive director. “I am deeply pleased by NSF’s recommitment to the growth of the South Hub and our community. Over the last three years, we have made great strides within our priority areas and are looking to broaden that reach in the next four years.”
In the week of July 29-Aug 2, 2019, more than 50 faculty and students from more 21 institutions participated in two R bootcamps at the University of Tennessee at Chattanooga (UTC). The iCompBio REU is supported by NSF Award 1852042, REU Site: ICompBio – Engaging Undergraduates in Interdisciplinary Computing for Biological Research. The first bootcamp on data wrangling using R was taught by Hong Qin, a computational biologist at UTC. Materials for this R Data Wrangling bootcamp is available at a public GitHub repository https://tinyurl.com/UTC-R-camps2019. The second bootcamp, Electronic Health Records, was taught by Elvena Fong and Zhuqi Miao from the Center for Health Systems Innovation at the Oklahoma State University.
From May 27 to August 5, 2019, a group of 12 students participated in a 10-week Interdisciplinary Computing for Biological Research REU program at the University of Tennessee at Chattanooga. These undergraduate researchers are from Fisk University, Tuskegee University, Morehouse College, Norfolk State University, University of Virgin Islands, Tennessee Technological University, Rhodes College, Shippensburg University of Pennsylvania, the University of Tennessee at Chattanooga. The majors of these students include 3 Mathematics, 2 Chemical Engineering, 3 Biology, 1 Biochemistry, 2 Computer Science, and 1 Computer Engineering. The iCompBio19 includes a total of 8 faculty mentors 2019 that come from Computer Science, Mathematics, Biology, Geology, and Chemical Engineering.
All students presented their REU research results at a poster symposium on July 31.
The Program to Empower Partnerships with Industry and Government (PEPI-G) supports faculty members, research scientists, postdocs, and graduate and undergraduate students (rising juniors and seniors as of 2019) from the 16 states that comprise the South Big Data Regional Innovation Hub (South BD Hub).
James Stevenson is an undergraduate student at Northern Kentucky University and is currently pursuing his degree in Information Technology with his focus being Cybersecurity. He’s a technologist at heart and enjoys everything related to cyberinfrastructure, social cybersecurity, the internet of things, and data manipulation. His goals for his senior year of college are to gain professional experience in his career field and to develop his technical skills. This fellowship provided by the Big South Data Hub will allow him to reach these goals.
Rachel St Clair is a doctoral student at Florida Atlantic University studying Complex Systems and Brain Sciences. Rachel’s main focus centers in multi-modal, translational machine learning in complex systems and brain sciences. Her background in both medicine and biology helps structure the integration of machine learning models for both academia and industry applications. Previous work involves a variety of research fields including mental disorder diagnosis, epileptic mice investigations, and synthetic drug detection. Drawing from interdisciplinary experiences drives her current integrative research in deep learning proteomics, computer vision, and therapeutic XR platforms. Her future accomplishments aim to include advancements in advanced machine perception and general AI. Rachel notes, ‘working with others who care deeply for the evolution of computerized cognitive task and their role in making the world a safer place would be a defining historical moment in my career path’.
The Department of Homeland Security – Advanced Research Projects Agency (DHS-ARPA) DA-E lab infrastructure consists of industry-standard servers and network gear, custom appliances built on the premise, and commercial and private cloud capabilities.
DHS’ identified Priority Areas:
Human Trafficking – Examining social media to aid in the fight against human trafficking focusing on Non-Text Data, Automating Search and Scalability
Real-time Analytics for Multi-party, Metro-scale Networks (RAMMMNets) – Data associated with the Internet-of-Things presents challenges to the analytic environments that inform human decision making.
Other Topics – Faculty fellows may propose other research topics for consideration.
With the end of the first grant, Dr. Lea Shanley has stepped down from her role as co-Executive Director of the South Hub. We appreciate her leadership and commitment to the Hub, fostering the All Hub Cyberinfrastructure and Social Cybersecurity Working groups, catalyzing and supporting our Spoke research teams, building numerous collaborative partnerships to benefit this community, and co-authoring the proposal for the next phase of the South BD Hub.
She sends her thanks to the National Science Foundation CISE, the South Hub’s many collaborators and working group co-chairs, the hardworking Executive and Deputy Directors across all four Hubs, and especially to the South Hub team!
Shannon McKeen will join the South Hub leadership team as Deputy Director. Shannon brings 20+ years of experience in strategic planning and university and corporate relations to the Hub team. He holds a BS in computer science from Williams College and an MBA from The Tuck School of Business at Dartmouth.
Thank You to the South Hub Community for an Incredible Meeting.
Contribute to the conversation with colleagues from industry, academia, nonprofits, and government during this two-day meeting to learn more about the accomplishments and future plans for the South Big Data Hub community. Your participation will strengthen our community and identify ways the SBDH can continue to support entities like you.
We invite you and/or your students to come and share your collaborative data science projects, discuss your progress, and highlight your impact
Call for Participants: NSF fundedMultidisciplinary Online Training Program with Stipend Support in Spring 2019on Big Data + High-Performance Computing + Atmospheric Sciences
Funded as an NSF grant to train graduate students, post-docs, and junior faculty on “Big Data + High-Performance Computing + Atmospheric Sciences”, our training program is a new NSF-funded initiative in big data applied to atmospheric sciences and using high-performance computing as a vital tool. The training consists of instruction in the areas of data, computing, and atmospheric sciences supported by teaching assistants, followed by faculty-guided project research in a multidisciplinary team of participants from each area. Participants around the nation will be exposed to multidisciplinary research experiences and have the opportunity for significant career growth. Continue reading →
Faculty teams from the DataUp program during the Instructor Training Workshop on Nov 6 & 7, 2018.
Society is increasingly becoming more data-driven and data-literate. It is vital every institution has the capabilities and infrastructure to engage and develop learners prepared to interact and succeed in such a society. Numerous studies have identified the expanding data divide between institution types and the need to develop successful bridge initiatives. The South Hub begin to address this need by creating a 3-part program, DataUp. Through this program, the South Hub is directly impacting each participating institution’s data science education capacities.
The first component of the program is a hosted 2-day data or software workshop presented by the Carpentries. This provided an opportunity for each participating institution to engage in a workshop that specifically addressed their data knowledge gaps (for more information on these workshops, Click Here). Exposing students to these intensive workshops, students are able to gain hands-on training and exposure to principles and tools, such as shell and JupyterHub. Removing the associated ‘fear factors’ empowers learners to employ and address challenges with data. The second component of the DataUp program is a 2-day pedagogy intensive instructor training.
The DataUp program visited Old Dominion University on Oct 25-26 to introduce shell, git, R, and the JupyterHub. The workshop included students, faculty and staff eager to engage with the analytical tools. This workshop brought together a ‘melting pot’ of faculty, staff, and students from various corners of the university to engage in a 2-day workshop. The concepts were chosen by ODU as Unix Shell & R are utilized in their High-Performance Computing Center. Even though intensive, one student noted, ‘it’s like drinking from an 8-hour firehouse, but the information is great. I knew nothing before Day 1 and [now I feel] more confident [after the first day]’. The 2-person instructor team led the traditional Carpentries curriculum, but their instruction was magnified by multiple ‘workshop helpers’. In training workshops such as these, learners have various expertise levels and learning styles. Multiple ‘workshop helpers’ and instructors reinforced the concepts as the diverse point of views benefited the wide range of learning styles. This was especially beneficial to a professor who noted that she learned to code about 15 years ago and was very nervous to retool herself but, ‘this workshop helped [her] to remove her fear [of coding]’. Continue reading →
Oftentimes, learning and education are discussed from the perspective of students. Unintentionally, excluding the fact that faculty have a natural interest in and curiosity for lifelong learning. Johnson C. Smith University’s DataUp institutional workshop attracted faculty and staff from diverse departments, including Psychology, Mathematics, and Library Services and IT. Over a two-day workshop, attendees discussed shell, git, and the interactive notebook JupyterHub. One learner noted that ‘it is [always] great to attend new workshops…it reminds you how students feel when learning new topics’. Many learners were new to the concepts but were eager to ask questions and find points of connectivity for their respective departments. By the middle of the first day, laughter and collaboration erupted throughout the session as individuals became more open to the concepts and asking questions. Continue reading →
On September 28 – 29, the DataUp program hosted a 2-day workshop at Texas A&M University – Kingsville, a historically Hispanic serving institution in Southern Texas. Arriving on the first day, a sense of eagerness buzzed throughout the classroom. Not only from the learners but also from the hosting faculty members. Faculty member, Dr. Min Zhoung, Assistant Professor of Environmental Engineering, noted that oftentimes learners tend to stray away from analytical or data science opportunities because of a ‘math or computational’ fear. The Carpentries instructors provided engaging and supportive hands-on Python & SQL tutorials and exercises utilizing the shared notebook, JupyterHub. Although there were learners of varying stages of coding knowledge and expertise, numerous students stated their excitement and gratitude for the workshop. One young woman who works in the TAMUK’s President’s office noted that her “projects took considerable time to set up and run, but with the tips from the workshop she can cut her project time down significantly”. The DataUp program’s mission, in part, is to increase capacities for and usage of data science tools. Learners noted that similar workshops moved either too fast or too slow, but this workshop provided the opportunity for all learners to ask questions and learn subtleties to maximize Python and JupyterHub. This workshop was timely for a young man attending a job interview the following week providing him the opportunity to ‘brush up on his verbiage and techniques’. Continue reading →