Ph.D. Researcher at the Center for Artificial Intelligence (C4AI) at the University of São Paulo, Brazil. Guest Ph.D. Candidate in Computer Science at the University of Twente, the Netherlands. Master's degree in Knowledge Management and Organization (2019). Bachelor in Library Science from the Federal University of Minas Gerais, Brazil (2017), and was an Erasmus student at the Polytechnic Institute of Porto, Portugal (2016-2017). Works on topics related to linked data, controlled vocabularies for the Semantic Web, metadata standards, management of scientific data, the FAIR Data Principles, and other related topics. Domains of application: biodiversity (focused on biological interactions, such as pollination, that can benefit sustainable agriculture somehow), agriculture, agrobiodiversity, and agroecology.

Expertise

  • Earth and Planetary Sciences

    • Citizen
    • Investigation
    • Standard
    • Project
    • Datum
    • Plant
    • Metadata
  • Social Sciences

    • Agriculture

Organisations

One of the main characteristics of the big data paradigm is the unstructured way in which data are published. Big data applications have great potential to be used in digital agriculture, but one may find it difficult to work with unstructured data since most datasets do not provide an accurate description of the data semantics, hampering interpretation and reuse. Consequently, semantic interoperability between datasets may be jeopardized, since the datasets do not share the same metadata concepts in their records. To address that problem, this research project proposes the development of metadata models for the semantic interoperability of datasets in the agricultural domain, focusing on the subdomain of agricultural commodities. To achieve our research objective, the Design Science research principles are adopted as the methodological approach, since it allows the development of research artifacts for the purpose of acquiring knowledge. Two main artifacts are going to be created: 1) a metadata schema for the domain; 2) a metadata semantic mapping among different schemas. The first artifact aims at the publication of agricultural commodities data as semantically structured data, and the second artifact aims at enabling interoperability among datasets by establishing a semantic equivalence between metadata terms used in each dataset. In the last stage of the research, the general applicability of the proposed metadata models will be challenged by applying them in another related field, namely food and nutrition. Finally, the metadata schema will be validated by domain specialists.

Publications

2025

Exploring a Large Language Model for Transforming Taxonomic Data into OWL: Lessons Learned and Implications for Ontology Development (2025)Data Intelligence, 7(2), 265-302. Soares, F. M., Saraiva, A. M., Pires, L. F., Santos, L. O. B. d. S., de Abreu Moreira, D., Corrêa, F. E., Braghetto, K. R., Pignatari Drucker, D. & Botazzo Delbem, A. C.https://doi.org/10.3724/2096-7004.di.2025.0020Exploring a Large Language Model for Transforming Taxonomic Data into OWL: Lessons Learned and Implications for Ontology Development (2025)[Working paper › Preprint]. ArXiv.org. Soares, F. M., Saraiva, A. M., Pires, L. F., Santos, L. O. B. d. S., de Abreu Moreira, D., Corrêa, F. E., Braghetto, K. R., Pignatari Drucker, D. & Botazzo Delbem, A. C.https://doi.org/10.48550/arXiv.2504.18651Supporting Data for "Exploring ChatGPT-4 for Transforming Taxonomic Data into OWL: Lessons Learned and Implications for Ontology Development" (2025)[Dataset Types › Dataset]. Zenodo. Soares, F., Saraiva, A. M., Ferreira Pires, L., Bonino, L., de Abreu Moreira, D., Corrêa, F. E., Braghetto, K. R., Pignatari Drucker, D. & Botazzo Delbem, A. C.https://doi.org/10.5281/zenodo.12684940Taxonomy OWLizer (2025)[Dataset Types › Dataset]. Zenodo. Miranda Soares, F.https://doi.org/10.5281/zenodo.13328561

2024

O design gráfico como ferramenta para a sustentabilidade: uma experiência do projeto pomar urbano (2024)In Anais do XV Congresso Brasileiro de Pesquisa e Desenvolvimento em Design - P&D Design. EDUA. Rangel Silva, R. & Soares, F. M.https://doi.org/10.29327/5457226.1-226Redes e parcerias para abertura de dados agrícolas (2024)In Agricultura digital, agrodados e regulação (pp. 65-76). Embrapa. Correa, F. E., Pignatari Drucker, D., Soares, F. M., Braghetto, K. R., Botazzo Delbem, A. C., Osório, F. S. & Saraiva, A. M.http://www.alice.cnptia.embrapa.br/alice/handle/doc/1170049Towards a Conceptual Model for FAIR Metadata Schemas (2024)In Companion Proceedings of the 43rd International Conference on Conceptual Modeling: ER Forum, Special Topics, Posters and Demos : Co-located with ER 2024 (pp. 42-55) (CEUR Workshop Proceedings; Vol. 3849). CEUR. Soares, F. M., Pires, L. F., Santos, L. O. B. d. S., Calhau, R. F., Moreira dos Santos Maculan, B. C., Coyle, K., Wang, S., Folmer, E., Pignatari Drucker, D., Campos, M. L. d. A., Marcondes, C. H., Almeida, M. B., Braghetto, K. R., Dias, G. A., Salim, J. A., Corrêa, F. E., de Abreu Moreira, D., Botazzo Delbem, A. C. & Saraiva, A. M.https://ceur-ws.org/Vol-3849/forum4.pdfCollectively Working towards Plant-Pollinator Interactions Data Interoperability and Reuse: Lessons Learned from the WorldFAIR Project (2024)Biodiversity Information Science and Standards, 8. Article e141109. Drucker, D., Salim, J., Poelen, J. & Miranda Soares, F.https://doi.org/10.3897/biss.8.141109Scripts for converting CSV data to RDF/Turtle using Python (2024)[Dataset Types › Dataset]. Zenodo. Soares, F.https://doi.org/10.5281/zenodo.13540476The C4AI Knowledge Graph on Agricultural Prices (C4AI-KGAP) (2024)[Dataset Types › Dataset]. Zenodo. Miranda Soares, F.https://doi.org/10.5281/zenodo.13685708

Other contributions

Soares, F. M., Correa, F. E., Pires, L. F., Santos, L. O. B. da S., Drucker, D. P., Braghetto, K. R., et al. (2022). Building a community-based FAIR metadata schema for Brazilian agriculture and livestock trading data. In SEMANTICS 2022 EU: 18th International Conference on Semantic Systems, Vienna, 2022. https://ceur-ws.org/Vol-3235/paper26.pdf 

Soares, F. M., Hamanaka, R. Y., Pontes, T. C. F., Araújo, W. J., & Maculan, B. C. M. dos S. (2022). A aplicação do método de análise de conteúdo na ciência da informação: um estudo preliminar no contexto das teses e dissertações da UFMG. Revista Ibero-Americana De Ciência Da Informação, 15(2), 327–350. https://doi.org/10.26512/rici.v15.n2.2022.36060

Drucker DP, Salim JA, Trekels M, Groom Q, Parr C, Soares FM, Agostini K, Saraiva AM, Molloy L, Hodson S, Gregory A (2022) Plant-pollinator Interaction Data: A case study of the WorldFAIR project. Biodiversity Information Science and Standards 6: e94310. https://doi.org/10.3897/biss.6.94310

José A Salim, Antonio M Saraiva, Paula F Zermoglio, Kayna Agostini, Marina Wolowski, Debora P Drucker, Filipi M Soares, Pedro J Bergamo, Isabela G Varassin, Leandro Freitas, Márcia M Maués, Andre R Rech, Allan K Veiga, Andre L Acosta, Andréa C Araujo, Anselmo Nogueira, Betina Blochtein, Breno M Freitas, Bruno C Albertini, Camila Maia-Silva, Carlos E P Nunes, Carmen S S Pires, Charles F dos Santos, Elisa P Queiroz, Etienne A Cartolano, Favízia F de Oliveira, Felipe W Amorim, Francisco E Fontúrbel, Gleycon V da Silva, Hélder Consolaro, Isabel Alves-dos-Santos, Isabel C Machado, Juliana S Silva, Kátia P Aleixo, Luísa G Carvalheiro, Márcia A Rocca, Mardiore Pinheiro, Michael Hrncir, Nathália S Streher, Patricia A Ferreira, Patricia M C de Albuquerque, Pietro K Maruyama, Rafael C Borges, Tereza C Giannini, Vinícius L G Brito, Data standardization of plant–pollinator interactions, GigaScience, Volume 11, 2022, giac043, https://doi.org/10.1093/gigascience/giac043

Sheina Koffler, Filipi Miranda Soares, Natalia Pirani Ghilardi-Lopes, Bruno Albertini, Debora Pignatari Drucker, José Augusto Salim, Patricia Nunes-Silva, Tiago Mauricio Francoy, Antonio Mauro Saraiva, & Claire Carvell. (2022). FIT Count Brasil: monitoramento de visitantes florais por contagem (p. 90). UFABC. https://doi.org/10.5281/zenodo.6419201

Sheina Koffler, Filipi Miranda Soares, Natália Pirani Ghilardi-Lopes, Bruno Albertini, Debora P Drucker, José Augusto Salim, Patrícia Nunes-Silva, Tiago Mauricio Francoy, Antonio Mauro Saraiva, Claire Carvell. FIT Count Brasil: aplicativo para monitoramento de polinizadores. In: Anais do II Workshop da Rede Brasileira de Ciência Cidadã. Anais...São Paulo(SP) online, RBCC, 2022. https://doi.org/10.29327/175207.1-1 

A. M. SARAIVA, B. C. ALBERTINI, C. ULSEN, D. VIANNA, D. MIRANDA, F. XAVIER, F. M. SOARES, G. R. BORGES, G. MACHADO, J. A. B. GRIMONI, J. OKAMOTO JR, L. ERMILIVITCH, M. L. VEGA, M. V. B. OKUYAMA, M. M. SECKLER, M. R. MAURO, M. C. S. PEREIRA, M. A. SIMPLICIO JUNIOR, N. A. CARMO, R. C. PUGLISI, R. SCARATI, T. C. M. CARVALHO, A. C. B. DELBEM, U. B. MONTEDO, W. F. COSTA, C. PARANAIBA, E. L. COELHO, R. SANTIAGO, T. S. CAMARGO. Planetary Health in Engineering: building a community. In 2022 Planetary Health Annual Meeting, Boston, 2022. https://www.researchgate.net/publication/364806693_Planetary_Health_in_Engineering_building_a_community

F. M. SOARES, S. KOFFLER, N. P. GHILARDI-LOPES, B. ALBERTINI, P. N. SILVA, DEBORA DRUCKER, C. CARVELL, J. CHIAZZESE, J. A. SALIM, T. M. FRANCOY, A. M. SARAIVA. Monitoring pollinators in Brazil: the challenge of adapting the FIT Count citizen science protocol to the Brazilian context. In 2022 Planetary Health Annual Meeting, Boston, 2022. https://www.researchgate.net/publication/364807317_Monitoring_pollinators_in_Brazil_the_challenge_of_adapting_the_FIT_Count_citizen_science_protocol_to_the_Brazilian_context

N. P. GHILARDI-LOPES, E. ZATTARA, K. AGOSTINI, T. M. FRANCOY, F. E. FONTURBEL, B. BLOCHTEIN, S. KOFFLER, F. M. SOARES, C. BARBIERI, A. M. SARAIVA. Citizen science and pollinators of South America. In ECSA Conference 2022: Citizen science for planetary health, Berlin, 2022. https://www.researchgate.net/publication/364806027_Citizen_science_and_pollinators_of_South_America

Koffler, Sheina, Acosta, Andre Luis, Soares, Filipi Miranda, & Saraiva, Antonio Mauro. (2022). 2021 Planetary Health Annual Meeting and Festival Book of Abstracts: Planetary Health for All: Bridging Communities to Achieve the Great Transition (p. 199). Institute of Advanced Studies of the University of São Paulo; Planetary Health Alliance. https://doi.org/10.5281/zenodo.6373367

Carvell, C. & Chiazzese, Jim & Zattara, Eduardo & Fontúrbel, Francisco & Muschett, Giselle & Ghilardi-Lopes, Natalia & Soares, Filipi. (2022). Monitoreo de visitas florales mediante el Conteo Cronometrado de Visitantes Florales (FIT Count). https://doi.org/10.4322/978-65-86819-21-2.s03c14.es

Carvell, C. & Chiazzese, Jim & Zattara, Eduardo & Fontúrbel, Francisco & Muschett, Giselle & Ghilardi-Lopes, Natalia & Soares, Filipi. (2022). Monitoramento da visitação de flores com Contagem Cronometrada de Visitantes Florais (FIT Count). https://doi.org/10.4322/978-65-86819-20-5.s03c14.pt. 

Soares, Filipi Miranda and Hamanaka, Raíssa Yuri Aplicação de metadados na padronização de registros de ocorrência de espécies no contexto da ciência cidadã para a biodiversidade: um estudo de caso., 2021 . In Organização do Conhecimento no Horizonte 2030: Desenvolvimento Sustentável e Saúde / coord. por Carlos Guardado da Silva, Jorge Revez, Luis Corujo, 2021, ISBN 978-989-566-137-4. https://dialnet.unirioja.es/servlet/articulo?codigo=8411215 

Corrêa, Pedro & Brandão, Anarosa & Júnior, Jorge & Almeida, Felipe & Soares, Filipi & Kimura, Leonardo & Tabuti, Lucy & Vellenich, Danton & Raimundo Mendes, Nilton Paulo & de Souza, Douglas & Ohara, Mauro & Badiali, Ana & Leone, Mariza & Carmo, Nilton & Rosa, Suzano & Chavez, Michelet & Matos, Rafael. (2021). Anais do X Workshop de Pós- Graduação de Engenharia da Computação.https://doi.org/10.5281/zenodo.5675699 . 

SOARES, F. M.; HAMANAKA, R. Y.; MACULAN, B. C. M. D. S. Interoperabilidade semântica no contexto de dados da biodiversidade: um estudo de caso sobre a utilização de padrões de metadados. In Encontro Nacional de Pesquisa e Pós-graduação em Ciência da Informação, 21., 2021, Rio de Janeiro. http://hdl.handle.net/20.500.11959/brapci/192460

Salim JA, Zermoglio PF, Drucker DP, Soares FM, Saraiva AM, Agostini K, Freitas L, Wolowski M, Rech AR, Maués MM, Varassin IG (2021) Plant-pollinator Vocabulary - a Contribution to Interaction Data Standardization. Biodiversity Information Science and Standards 5: e75636. https://doi.org/10.3897/biss.5.75636

Soares, F. M., B. C. M. dos S. Maculan, D. P. Drucker, e A. M. Saraiva. “Methodological Principles to Create a Metadata Extension to the Darwin Core Standard for Agrobiodiversity Data”. Brazilian Journal of Information Science: Research Trends, vol. 14, nº 4 - out-dez, dezembro de 2020, p. e020015, doi:10.36311/1940-1640.2020.v14n4.10865.

Soares FM, Saraiva AM, Drucker DP (2020) Linking Agrobiodiversity Data through Metadata Standards. Biodiversity Information Science and Standards 4: e58928. https://doi.org/10.3897/biss.4.58928

MOREIRA, C.; SOARES, F. M. .; HAMANAKA, R. Y.; BUENO, R. V. .; AGANETTE, E. C. . Produtos da Ciência da Informação para o processo de doação de bens permanentes em Instituições Públicas: o caso do projeto Motirõ. Múltiplos Olhares em Ciência da Informação, [S. l.], v. 10, 2020. DOI: 10.35699/2237-6658.2020.20344. Disponível em: https://periodicos.ufmg.br/index.php/moci/article/view/20344.

SOARES , F. M. .; MACULAN , B. C. .; DRUCKER, D. . P. . Padrão de metadado Darwin Core: Proposta de extensão para as interações ecológicas no contexto da agrobiodiversidade. Múltiplos Olhares em Ciência da Informação, [S. l.], v. 9, n. 2, 2020. Disponível em: https://periodicos.ufmg.br/index.php/moci/article/view/19179.

Miranda Soares, F. (2019). Aplicação do software Open Monograph Press para criação de uma biblioteca digital de monografias de uma instituição de ensino superior. LIBERTAS: Revista De Ciênciais Sociais Aplicadas, 9(2), 28-52. Recuperado de https://famigvirtual.com.br/famig-libertas/index.php/libertas/article/view/244

Miranda Soares, F., Coura Moreira dos Santos Maculan, B., & Mendonça Oliveira, L. H. (2019). Criação de tesauro em software especializado. LIBERTAS: Revista De Ciênciais Sociais Aplicadas, 9(1), 174-189. Recuperado de https://famigvirtual.com.br/famig-libertas/index.php/libertas/article/view/239

Hamanaka, Raíssa Yuri, & Soares, Filipi Miranda. (2019). A relação entre o mapeamento de processos e a modelização no contexto da gestão do conhecimento: estudo de caso aplicado em uma biblioteca digital. Investigación bibliotecológica, 33(81), 223-240. Epub 21 de abril de 2020.https://doi.org/10.22201/iibi.24488321xe.2019.81.57997

Soares FM, Maculan BCMS, Drucker D (2019) Darwin Core for Agricultural Biodiversity: A metadata extension proposal. Biodiversity Information Science and Standards 3: e37053. https://doi.org/10.3897/biss.3.37053

Research profiles

Current projects

Pomar Urbano (Urban Orchard)

Urbanization presents substantial social challenges, particularly in emerging countries, including issues such as food scarcity, poverty, deteriorating human health and well-being, air pollution, and biodiversity loss. However, even within large urban areas like São Paulo, there are still ample green spaces where a diverse range of flora can be found. Despite the urban landscape, these pockets of greenery offer valuable opportunities for nature appreciation and conservation, contributing to the overall resilience and livability of the city. Pomar Urbano (Urban Orchard) is a project focused on monitoring fruit-bearing plant species in urban areas throughout Brazil. The main objective of this initiative is to create a comprehensive knowledge base regarding the timing and locations of fruit-bearing plants in Brazilian cities. Many of these plants are situated in public spaces like parks and sidewalks, offering people the opportunity to freely enjoy the benefits they provide. Citizen science has been adopted as the collaborative approach to engage society in species monitoring. The research encompasses a wide range of fruit-bearing plant species, with a particular emphasis on native varieties found in Brazil. To ensure broad coverage, the project spans all 27 state capitals in Brazil, including the Federal District. The data collection process relies on using iNaturalist, a citizen science platform with an advanced Computer Vision model for species identification through uploaded photographs.

WorldFAIR project

Agricultural biodiversity case study

Plant-pollinator interactions are recognized for their key role in ecosystem functioning and sustainable agriculture. However, plant-pollinator data are currently stored in silos across multiple networks and country-specific initiatives. The capacity to integrate those data at regional and global levels is crucial to enable pattern analysis and understanding at biologically-relevant scales. In this context, adoption of community data standards on pollination and good practices is urgently needed. This case study will ensure broad participation and alignment with other agricultural data initiatives in Europe and at the global level to facilitate the implementation of the FAIR data principles. A survey of existing initiatives handling plant-pollinator interaction data will be conducted and an overview of the current status of best practices for plant-pollinator data management will be provided and discussed within the community for improvement. FAIR data assessment rubrics will be adapted for the plant-pollinator domain, to be accompanied by guidelines for their use. At least five agriculture-specific plant-pollination initiatives will serve as pilots for data and digital objects standards adoption. RDA IGAD (Interest Group on Agricultural Data) is leading this effort together with partners in the Biodiversity Information Standards group already involved with developing standards for plant-pollination data in order to advance adoption.

Flower-Insect Timed Count (FIT Count)

FIT Count Brazil

FIT Counts are very simple – you watch a patch of flowers for 10 minutes and count how many insects visit. It is a very useful tool for individuals, community groups and others to measure change in their local biodiversity. If you’ve taken action as part of the All-Ireland Pollinator Plan, carrying out FIT Counts throughout the year and across future years will help track the impact of your actions on insect numbers and diversity. The easiest way to carry out a FIT Count is to use the FIT Count app available from App Store and Google Play.

Safeguarding Pollination Services In A Changing World (SURPASS2)

SURPASS2 is an international partnership between Argentina, Brazil, Chile and the UK, working on pollinators and pollination services in South America. Our objectives are to develop knowledge, build capacity and define tangible actions for conservation and sustainable use of pollinators. SURPASS2 will deliver evidence for the creation of resilient pollination services for sustainable economic growth, positive agricultural and environmental outcomes for improved human health and wellbeing. Through our research activities, we are providing crucial knowledge to food producers, policy-makers, land managers and the public who need better evidence based tools to support decision making for sustainable outcomes. We aim to offer improvements to the future cultural and social recognition of the vital roles that pollinators, and those that work with them, play in sustaining crop production and ecosystem functioning.

Finished projects

Brazil Fifth National Action Plan - Open Government Partnership

Commitment 5: Promote the opening and integration of agricultural value chain databases in accordance with the public interest

The commitment is to engage the government and civil society in the opening and integration of priority databases related to the agricultural value chain, considering the risks, impacts and feasibility of these actions. To fulfill the commitment, government bodies and civil society organizations will participate in a meeting (milestone 1) and carry out an assessment of existing databases (milestone 2). These milestones aim to ensure collaboration between agricultural value chain actors in the opening and integration of databases, understand civil society demands, survey existing initiatives on the theme to avoid duplication of efforts, indicate priority agricultural value chains for opening of data and consolidate, analyze and generate new data and information of strategic interest. These initial actions will serve as preparation for the following actions, which consist of assessing the risks, impacts and feasibility of opening and integrating databases (milestone 3) considering the ethical and responsible reuse of available data, database security and the protection of sensitive data. Milestone 4 involves defining the minimum metadata set needed to ensure data interoperability and transparency. After implementing these actions, priority databases to be opened and made interoperable will be defined (milestone 5).

Scan the QR code or
Download vCard