Challenges in cloud infrastructure and scientific data
technical proposals and tools applied to the SciELO database
DOI:
https://doi.org/10.22477/vi.widat.44Keywords:
Big Data, Data analytics, data lifecycle, information retrievalAbstract
Introduction: Cloud computing applications have significant relevance for scientific databases based on research data. This work aims to report the proposed requisites and steps of technologies and tools to develop cloud computing applications based on research data for the SciELO database. Methods: We have used commercial cloud infrastructure and proposed a step-by-step method that could be applied to unstructured databases aiming at a comprehensive data model. Results: We have iteratively looked at data entries and developed SQL scripts that build a data model for the SciELO database. All the scripts are made public. Conclusion: Despite the data sovereignty concerns, commercial cloud services are a good option for short term projects. Also, the sequence of steps and general structure of the scripts can be used to make other similar open databases available and useful.
Downloads
Downloads
Published
How to Cite
Conference Proceedings Volume
Section
License
Você tem o direito de:
- Compartilhar — copiar e redistribuir o material em qualquer suporte ou formato
- Adaptar — remixar, transformar, e criar a partir do material para qualquer fim, mesmo que comercial.
O licenciante não pode revogar estes direitos desde que você respeite os termos da licença.
