Costa Rica Big Data School 2018

Costa Rica National High Technology Center

December 3-7, 2018

he Costa Rica National Research and Education Network: RedCONARE and the Costa Rica National High Technology Center (CeNAT) are proud to host the second Costa Rica Big Data School.

The main goal of this event is training students, teachers, and researchers of the Costa Rica Public Education System in main Big Data topics Python, Hadoop, Spark, Data Analytics, Data Visualization, Machine and, Deep Learning, and more.

Big Data is a well-known term in computer science due to its wide application and the maturity of the tools and techniques developed in the area. That’s why we decided to host the school of Big Data in Costa Rica, free of charge for those involved in the educational public system. We hope you enjoy what we have prepared, and take full advantage of this exciting event.

Ing. Carlos Gamboa Venegas
RedCONARE Scientific Coordinator
Costa Rica Big Data School Chair

Evaluation

Evaluation form

Speakers

Intel Costa Rica Data Science

Intel Costa Rica Data Science is a group of people who have a common interest in facilitating learning, information sharing, ideation and embedding of Data Science into Intel Costa Rica.

This Data Science group is part of the Intel Costa Rica Communities of Practice effort. Data Science CoP is divided into 3 main areas, Big Data, Analytics and Business Intelligence, the members participate providing training, technical talks and collaborating on projects within Intel on topics related to Data Science.

Instructors

Weijia Xu, Ph.D.

Research Engineer
Manager, Data Mining And Statistics

Biography: Dr. Weijia Xu is the group lead for Data Mining & Statistics group. Prior to joining TACC, he obtained a master’s degree in Biological Sciences and a doctoral degree in Computer Science from The University of Texas at Austin.

Dr. Xu’s main research interest is in the field of large-scale information management and analysis. The goal of his research is to enable data-driven discoveries through developing new methods and applications that facilitate the data to knowledge transfer process. Dr. Xu has extensive experiences in working with domain scientists in databases and analytical methods development. Dr. Xu has over thirty peer-reviewed conference and journal publications in similarity based data retrieval, data analysis and information visualization with data from various scientific domains.

Charlie Dey. B.A.

Director, Training And Professional Development

Biography: Charlie is the Director of Training and Professional Development with the User Services group at TACC with a background in web development and scientific computing. Charlie’s responsibilities at TACC include organizing, developing content, and building curriculums for TACC’s academic course selection taught in conjunction with several departments at the University of Texas at Austin, as well as for TACC’s professional development and educational training.

Prior to joining TACC, he worked as a Senior Application Developer for the Carle Foundation, and as a computer science instructor at Parkland College in Champaign, IL. He was also a member of a specialized application development team at the University of Illinois and has also been a contracted research consultant for NASA Ames Research Center, studying computational immunology and bioinformatics. Charlie holds a Bachelor’s Degree concentrating in Computer Science and Biology from Eastern Illinois University and certifications in 3D programming and visualization.

Agenda

DAY & TIMEMonday 3rdTuesday 4thWednesday 5thThursday 6thFriday 7th
8:00am – 8:30amRegistry
8:30am – 10:00amInaugurationIntro to TACC System and HPC basicsFrom Hadoop to Spark clusterSpark Framework and MLlibUnsupervised Learning methods
Keynote: Desde la Estadística hasta la Ciencia de Datos pasando por el concepto de «Big Data»
10:00am – 10:30amMorning Break
10:30am – 12:00pmPresenter: 7 consejos para integrar la ciencia de datos a la dinámica de una empresa. Raúl Trejos EspinozaIntroduction of MapReduce and HadoopBig Data Analysis with SparkSupervised Learning methodsDeep learning with python
12:00pm – 1:00pmLunch
1:00pm – 2:30pmBig Data Fundamentals (Intel Costa Rica Data Science)Intro to Python Programming IAdvanced Python Programming IPython PandasContainers and Cloud Computing I
2:30pm – 3:00pmAfternoon Break
3:00pm – 4:30pmBig Data Fundamentals (Intel Costa Rica Data Science)Intro to Python Programming IIAdvanced Python Programming IIVisualization with PythonContainers and Cloud Computing II

Material

  • Presentación Desde la Estadística hasta la Ciencia de Datos (Dr. Oldermar Rodríguez)
    Shared Folder: http://tinyurl.com/cenat-bigdata2018

Registry

Tuition fee

Participation is free. There are no tuition costs associated with participating in this school for those affiliated to CONARE institutions.

Maximum quota

The maximum quota is 60 participants.

Inscription

The following form has to be fully filled before November 22nd. Accepted participants will be notified via email on Friday 23th.

Important dates

  • Start of the application process to the School: October 31st.
  • The closing of the application process to the School: November 22rd.
  • Notification of acceptance/rejection in the participation of the School: November 23th.

Requirements

Being a student, teacher or researcher of any public university (UCR, TEC, UNA, UNED, UTN), from CONARE or any of its ascribed programs: CeNAT, PEN and SINAES.

Also, we are admitting functionaries of the Ministries and Public Entities of the Government of Costa Rica. (Limited spaces)

Have an intermediate English knowledge (reading and hearing). All of the presentations and exercises are going to be in this language.

 

Having basic programming skills (Python are desired) and basic Linux handling.

 

Scholarships

Costa Rica Big Data Schools will have a scholarship program for students of the public universities of Costa Rica that live outside the Great Metropolitan Area (GAM).

Inscription

The following form has to be fully filled before November, Sunday 19th. Accepted scholarships will be notified via email on Monday 20th.

Important dates

  • Start of the application process for the scholarship to participate in the School: October 31th.
  • The closing of the application process for the scholarship to participate in the School: November 18th.
  • Notification of acceptance/rejection of the scholarship to participate in the School: November 20th.

Requirements

  1. Being an active graduate student, with at least one completed major year, of any of the subsidiary institutions of any of the five public universities (UCR, TEC, UNA, UNED, UTN) located outside the Great Metropolitan Area (GAM).
  2. English intermediate knowledge (reading and hearing). All of the presentations and exercises are going to be in this language.
  3. Having basic programming skills (knowledge in R and Python are desired) and basic Linux handling.
  4. Participation in all the School activities. In case of not attending the total of the talks and workshops, the Organization may require the scholarship holder to refund the total amount of the scholarship.
  5. Attach the following documents in a single PDF file, which does not weigh more than 10 Mb. Applications with incomplete documents will not be accepted:
    • Participant’s letter of interest detailing why he/she wants to participate in the School.
    • Letter from the Director or Coordinator of the current career, stating that he is an active student of said career and university. Important to clarify the subsidiary where you are enrolled.
    • Two recommendation letter from two faculty staff members.

Organizers

RedCONARE is the Costa Rica National Research and Education Network (NREN). It provides technical infrastructure and communication services like eduroam, Mconf, LA Referencia, and the Colaboratorio, among others. The NRENs or Advanced Networks are common spaces that the universities research community has among the world to enhance their knowledge and contributions to humanity. In Costa Rica, RedCONARE has been positioning as a research space and join collaboration among its members.

The Advanced Computing Laboratory (CNCA) at Costa Rica High Technology Center (CeNAT) is a multidisciplinary space where scientific discovery is accelerated through an advanced computing infrastructure. This infrastructure includes not only specialized and updated hardware, but also a set of efficient applications and well-trained staff in order to take advantage of all the technology. This allows CNCA to work in the main dimensions of research, project development, training, and services provision.

TACC inspires and educates the next generation of computational scientists and technologists and increases the public’s understanding of the roles computing and science play in shaping our society. To educate the next generation of researchers and computational professionals, TACC developed a unique scientific computing curriculum for The University of Texas at Austin.

Sponsors

Gallery

Location