<?xml version="1.0" encoding="UTF-8"?>
<record
    xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
    xsi:schemaLocation="http://www.loc.gov/MARC21/slim http://www.loc.gov/standards/marcxml/schema/MARC21slim.xsd"
    xmlns="http://www.loc.gov/MARC21/slim">

  <leader>04538nab a2200325 i 4500</leader>
  <controlfield tag="003">BR-BrBNA</controlfield>
  <controlfield tag="005">20240523121810.0</controlfield>
  <controlfield tag="008">240523b2022    bl.ar|pooa||| 00| 0 eng |</controlfield>
  <datafield tag="040" ind1=" " ind2=" ">
    <subfield code="a">BR-BrBNA</subfield>
    <subfield code="b">eng</subfield>
  </datafield>
  <datafield tag="072" ind1=" " ind2=" ">
    <subfield code="a">K10</subfield>
  </datafield>
  <datafield tag="072" ind1=" " ind2=" ">
    <subfield code="a">U40</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="a">Carvalho, M&#xF4;nica Canaan </subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="a">Gomide, Lucas Rezende </subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="a">Scolforo, Jos&#xE9; Roberto Soares </subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="a">P&#xE1;scoa, Kalill Jos&#xE9; Viana da </subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="a">Ara&#xFA;jo, La&#xED;s Almeida </subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="a">Lopes, Is&#xE1;ira Leite e </subfield>
  </datafield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">Data mining applied to feature selection methods for aboveground carbon stock modelling</subfield>
  </datafield>
  <datafield tag="500" ind1=" " ind2=" ">
    <subfield code="a">Publica&#xE7;&#xE3;o on-line; 27 ref.; 4 illus.; Summaries (En, Pt)</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">
Abstract &#x2013; The objective of this work was to apply the random forest (RF) algorithm to the modelling of the aboveground carbon (AGC) stock of a tropical forest by testing three feature selection procedures &#x2013; recursive removal and the uniobjective and multiobjective genetic algorithms (GAs). The used database covered 1,007 plots sampled in the Rio Grande watershed, in the state of Minas Gerais state, Brazil, and 114 environmental variables (climatic, edaphic, geographic, terrain, and spectral). The best feature selection strategy &#x2013; RF with multiobjective GA &#x2013; reaches the minor root-square error of 17.75 Mg ha-1 with only four spectral variables &#x2013; normalized difference moisture index, normalized burn ratio 2 correlation texture, treecover, and latent heat flux &#x2013;, which represents a reduction of 96.5% in the size of the database. Feature selection strategies assist in obtaining a better RF performance, by improving the accuracy and reducing the volume of the data. Although the recursive removal and multiobjective GA showed a similar performance as feature selection strategies, the latter presents the smallest subset of variables, with the highest accuracy. The findings of this study highlight the importance of using near infrared, short wavelengths, and derived vegetation indices for the remote-sense-based estimation of AGC. The MODIS products show a significant relationship with the AGC stock and should be further explored by the scientific community for the modelling of this stock.


Index terms: forest management, genetic algorithm, random forest.</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">
Resumo &#x2013; O objetivo deste trabalho foi aplicar o algoritmo &#x201C;random forest&#x201D; (RF) &#xE0; modelagem do estoque de carbono acima do solo (CAS) de uma floresta tropical, por meio da testagem de tr&#xEA;s procedimentos de sele&#xE7;&#xE3;o de vari&#xE1;veis: remo&#xE7;&#xE3;o recursiva e algoritmos gen&#xE9;ticos (AGs) uniobjetivo e multiobjetivo. Os dados utilizados abrangeram 1.007 parcelas amostradas na bacia hidrogr&#xE1;fica do Rio Grande, no estado de Minas Gerais, Brasil, e 114 vari&#xE1;veis ambientais (clim&#xE1;ticas, ed&#xE1;ficas, geogr&#xE1;ficas, de terreno e espectrais). A melhor estrat&#xE9;gia de sele&#xE7;&#xE3;o de vari&#xE1;veis &#x2013; a RF com AG multiobjetivo &#x2013; chega ao menor erro quadr&#xE1;tico de 17,75 Mg ha-1 com apenas quatro vari&#xE1;veis espectrais &#x2013; &#xED;ndice de umidade por diferen&#xE7;a normalizada, textura de correla&#xE7;&#xE3;o do &#xED;ndice de queimada por raz&#xE3;o normalizada 2, cobertura arb&#xF3;rea e fluxo de calor latente &#x2013;, o que representa redu&#xE7;&#xE3;o de 96,5% no tamanho do banco de dados. As estrat&#xE9;gias de sele&#xE7;&#xE3;o de vari&#xE1;veis ajudam a obter melhor desempenho da RF, ao melhorar a acur&#xE1;cia e reduzir o volume dos dados. Embora a remo&#xE7;&#xE3;o recursiva e o AG multiobjetivo mostrem desempenho semelhante como estrat&#xE9;gias de sele&#xE7;&#xE3;o de vari&#xE1;veis, esta &#xFA;ltimo apresenta menor subconjunto de vari&#xE1;veis, com maior precis&#xE3;o. As descobertas deste trabalho destacam a import&#xE2;ncia do uso de infravermelho pr&#xF3;ximo, comprimentos de onda curtos e &#xED;ndices de  vegeta&#xE7;&#xE3;o derivados para a estimativa de CAS baseada em sensoriamento remoto. Os produtos MODIS mostram rela&#xE7;&#xE3;o significativa com o estoque de CAS e precisam ser melhor explorados pela comunidade cient&#xED;fica para a modelagem deste estoque.


Termos para indexa&#xE7;&#xE3;o: manejo florestal, algoritmo gen&#xE9;tico, floresta aleat&#xF3;ria.</subfield>
  </datafield>
  <datafield tag="650" ind1=" " ind2=" ">
    <subfield code="a">FLORESTA TROPICAL</subfield>
  </datafield>
  <datafield tag="650" ind1=" " ind2=" ">
    <subfield code="a">MANEJO</subfield>
  </datafield>
  <datafield tag="650" ind1=" " ind2=" ">
    <subfield code="a">SENSORIAMENTO REMOTO</subfield>
  </datafield>
  <datafield tag="650" ind1=" " ind2=" ">
    <subfield code="a">CARBONO</subfield>
  </datafield>
  <datafield tag="773" ind1="0" ind2=" ">
    <subfield code="0">920</subfield>
    <subfield code="9">25802</subfield>
    <subfield code="d">Bras&#xED;lia-DF Empresa Brasileira de Pesquisa Agropecu&#xE1;ria - EMBRAPA 1966-</subfield>
    <subfield code="o">2023-436358</subfield>
    <subfield code="t">Pesquisa Agropecu&#xE1;ria Brasileira (Brazil)</subfield>
    <subfield code="x">0100-204X</subfield>
    <subfield code="g">v. 57 p. 1-13; (2022)</subfield>
    <subfield code="w">BR2024001081</subfield>
  </datafield>
  <datafield tag="856" ind1=" " ind2=" ">
    <subfield code="u">https://www.scielo.br/j/pab/a/679y4MZ9D5C4gZxZ4M7rGHv/?format=pdf&amp;lang=en</subfield>
  </datafield>
  <datafield tag="942" ind1=" " ind2=" ">
    <subfield code="c">Anal&#xED;tica</subfield>
  </datafield>
  <datafield tag="999" ind1=" " ind2=" ">
    <subfield code="c">299739</subfield>
    <subfield code="d">299739</subfield>
  </datafield>
</record>
