Navegar Autor

Jesús Soto

Developing an intelligent system for the prediction of soil properties with a portable mid-infrared instrument

biosystemsYa está disponible nuestro nueva publicación.

Developing an intelligent system for the prediction of soil properties with a portable mid-infrared instrument.


•Different machine learning techniques have been tested to predict soil properties.
•The predicted soil properties are TC, TN, CEC, clay, silt and Na+.
•The best predictive machine learning technique has been the Gaussian Process.
•The Gaussian process is better compared to the traditional PLSR technique.
•The Gaussian Process is the candidate for the development of intelligent system.

High-Throughput Infrastructure for Advanced ITS Services

articulo_ITSServicesHoy nos han comunicado la publicación de un nuevo artículo, con el que llevábamos trabajando desde el año pasado.

High-Throughput Infrastructure for Advanced ITS Services: A Case Study on Air Pollution Monitoring.


Novel cooperative intelligent transportation systems (ITS) serve as the basis for the provision of a number of services for drivers, occupants, and third parties. The vast amount of information to be collected, especially in vehicle-to-infrastructure (V2I) communication services, requires new algorithms and hardware platforms to cope with real-time requirements; however, this combination is not properly addressed in the literature. In this paper, we introduce a high-throughput hardware-software infrastructure to gather information from vehicles and efficiently process it to provide novel ITS services. We propose a parallelization approach of a fuzzy clustering technique on heterogeneous servers based on CPU and several GPUs, tailored to classification problems in V2I. The infrastructure is empirically tested to offer a geo-located pollution information service through the periodical collection of both vehicle’s position and status data. We offer a real service that correctly identifies highly polluting traffic areas and drivers. The results indicate a good performance of the system under high loads, and our scalability analysis reveals a good operation in real-ambitious deployments thanks to the use of the both CPU and multiple GPUs, showing that our proposal can efficiently host cooperative services involving high processing in the ITS context.

A novel fuzzy clustering approach to regionalise watersheds

paperHydrology Otra artículo que nos publican: A novel fuzzy clustering approach to regionalise watersheds with an automatic determination of optimal number of clusters. Una nueva aplicación de los algoritmos de clasificación difusa.


One of the most important problems faced in hydrology is the estimation of flood magnitudes and frequencies in ungauged basins. Hydrological regionalisation is used to transfer information from gauged watersheds to ungauged watersheds. However, to obtain reliable results, the watersheds involved must have a similar hydrological behaviour. In this study, two different clustering approaches are used and compared to identify the hydrologically homogeneous regions. Fuzzy C-Means algorithm (FCM), which is widely used for regionalisation studies, needs the calculation of cluster validity indices in order to determine the optimal number of clusters. Fuzzy Minimals algorithm (FM), which presents an advantage compared with others fuzzy clustering algorithms, does not need to know a priori the number of clusters, so cluster validity indices are not used. Regional homogeneity test based on L-moments approach is used to check homogeneity of regions identified by both cluster analysis approaches. The validation of the FM algorithm in deriving homogeneous regions for flood frequency analysis is illustrated through its application to data from the watersheds in Alto Genil (South Spain). According to the results, FM algorithm is recommended for identifying the hydrologically homogeneous regions for regional frequency analysis.

Influence of multivariate modeling in the prediction of soil carbon by a portable infrared sensor

congresokorea Contribución Workshop Proceedings of the 13th International Conference on Intelligent Environments, Seoul, Korea, August 2017.


The determination of carbon is one of the most important in soil analysis. However traditional techniques are costly and time consuming. In this manuscript we propose an alternative predictive approach based on portable mid-infrared spectroscopy data modeled by machine learning techniques. We evaluate the performance of different machine learning models and sample size to predict soil carbon in 457 Australian soils. The results show a good performance of the models. All models are validate by statistical tests. The best performing technique with a 99% of confidence level is the Gaussian Process providing a 98% of accuracy for the prediction of soil carbon. Moreover, this technique is the most robust for the different sample sizes tested. When compared with the commonly used Partial Least Squares Regression technique, the machine learning approaches provide more successful and balanced results.

Big Data con MATLAB

MatlabbigDataInteresante Webinars de cómo MatLab trata grandes conjuntos de datos

Big Data con MATLAB
Paz Tárrega, MathWorks

Descripción general

Según crecen nuestros datos en tamaño y complejidad, se hace más difícil trabajar con ellos, particularmente cuando los datos no caben en memoria. MATLAB ofrece un entorno único para trabajar con big data conviertiendo el análisis y el proceso de big data en fácil, coveniente y escalable.

En este webinar aprenderá estrategias y técnicas para manejar grandes volúmenes de datos en MATLAB. Se muestran las nuevas capacidades de la versión 2016b de MATLAB, incluyendo tall arrays. Utilizando tall arrays puede prescindir de aprender programación orientada a big data o técnicas de manejo de datos fuera de memoria, simplemente utilice el código y la sintaxis que utiliza hasta ahora de MATLAB.

Fuzzy clustering as rational partition method for QSAR

articuloQSARLas técnicas de Fuzzy Clustering podemos aplicarlas en diferentes campos. En este ejemplo tenemos una colaboración que busca mejorar los métodos QSAR, de técnicas computacionales relacionadas con el cálculo de propiedades fisicoquímicas moleculares.


Various methods are used to make the partition of data sets for QSAR development and model validation. In this work we used a fuzzy minimals partitioning and we compare this methodology with another rational partition methods like k-means clustering (KMS) and Minimal Test Set Dissimilarity (MTSD). For the development of QSAR models Ordinary Least Squares (OLS) and Extreme Learning Machine (ELM) methods were used. The generated QSAR equations were validated by the coefficient of determination of the internal leave one out (LOO) cross validation method QLOO 2 and then the coefficient of the external test set Qext 2 was compared between partition methods. The results of this comparison showed that using fuzzy minimal for big and structurally diverse data sets gave an applicability domain similar to KMS and a better predictability models than both methods, KMS and MTSD.

Using SWAT and Fuzzy TOPSIS

paperWaterNos acaban de publicar un artículo: Using SWAT and Fuzzy TOPSIS to Assess the Impact of Climate Change in the Headwaters of the Segura River Basin (SE Spain).


The Segura River Basin is one of the most water-stressed basins in Mediterranean Europe. If we add to the actual situation that most climate change projections forecast important decreases in water resource availability in the Mediterranean region, the situation will become totally unsustainable. This study assessed the impact of climate change in the headwaters of the Segura River Basin using the Soil and Water Assessment Tool (SWAT) with bias-corrected precipitation and temperature data from two Regional Climate Models (RCMs) for the medium term (2041–2070) and the long term (2071–2100) under two emission scenarios (RCP4.5 and RCP8.5). Bias correction was performed using the distribution mapping approach. The fuzzy TOPSIS technique was applied to rank a set of nine GCM–RCM combinations, choosing the climate models with a higher relative closeness. The study results show that the SWAT performed satisfactorily for both calibration (NSE = 0.80) and validation (NSE = 0.77) periods. Comparing the long-term and baseline (1971–2000) periods, precipitation showed a negative trend between 6% and 32%, whereas projected annual mean temperatures demonstrated an estimated increase of 1.5–3.3 °C. Water resources were estimated to experience a decrease of 2%–54%. These findings provide local water management authorities with very useful information in the face of climate change.

Mi contribución se centra en la aplicación de las técnicas de Fuzzy TOPSIS para la selección de los modelos del cambio de clima.

El problema de los acentos en Kubuntu

En la reciente instalación de Kubuntu 16.04 LTS me apareció un problema extraño, no dejaba acentuar las palabras. Después de buscar la solución con el cambio de lenguaje, las configuraciones locales y instalaciones que había que realizar en cada inicio, he topado con la solución: es el problema de la «tilde muerta». Las distribuciones de Español incluyen la posibilidad de utilizar la Variante Español(incluir tilde muerta). Así solucionamos el incordio. Todo lo hacemos desde la ventana

Parallel implementation of fuzzy minimals clustering algorithm



Clustering aims to classify different patterns into groups called clusters. Many algorithms for both hard and fuzzy clustering have been developed to deal with exploratory data analysis in many contexts such as image processing, pattern recognition, etc. However, we are witnessing the era of big data computing where computing resources are becoming the main bottleneck to deal with those large datasets. In this context, sequential algorithms need to be redesigned and even rethought to fully leverage the emergent massively parallel architectures. In this paper, we propose a parallel implementation of the fuzzy minimals clustering algorithm called Parallel Fuzzy Minimal (PFM). Our experimental results reveal linear speed-up of PFM when compared to the sequential counterpart version, keeping very good classification quality.

No entres dócilmente…

interstellar1 Hace unos días he vuelto a ver Interstellar (2014). Sigo considerándola una bella historia de amor. Una de las cosas que he vuelto a disfrutar viéndola es su maravillosa música. Hans Zimmer vuelve a conmovernos con este poema musical que se adapta con armonía a las imágenes.

Junto a la música aparece un poema sencillo y conmovedor: No entres dócilmente en esa buena noche. El poema de Dylan Thomas se ha utilizado en muchas ocasiones, pero aquí se siente partícipe de la historia.

En el minuto 47:38 el profesor John Brand(Michael Caine) recita el principio del poema(en mi modesta opinión la traducción es mejor que las que circulan por internet, el original aquí).

No entres dócilmente en esa buena noche.
La vejez debería delirar y arder cuando se acaba el día.
Rabia, rabia, contra la luz que se esconde.

Aunque el sabio cerca del fin a la tiniebla no haga reproche,
dado que a su verbo ningún rayo ha confiado vigor,
no entra dócilmente en esa buena noche.

Rabia, rabia, contra la luz que se esconde

(En el minuto 1:52:12 el Dr. Hugh Mann(Matt Damon) repite los tres primeros versos)

Por último una infografía muy ilustrativa del tiempo y la física relativista.