Universitat Politècnica de Catalunya

508 Seguidores

299 SlideShares 508 Seguidores 49 Seguindos

Xavier Giro-i-Nieto is an assistant professor at the Universitat Politecnica de Catalunya (UPC). He graduated in Electrical Engineering studies at ETSETB (UPC) in 2000, after completing his master thesis on image compression at the Vrije Universiteit in Brussels (VUB) under the direction of Professor Peter Schelkens. In 2001 he worked in the digital television group of Sony Brussels, before returning to Barcelona and joining the Image Processing Group at the UPC. In 2003, he started teaching courses in Electrical Engineering degress at the EET and ETSETB schools from UPC. He obtained his Phd on image retrieval in 2012, under the supervision by Professor Ferran Marques from UPC and Profess...

deep learning computer vision recurrent neural networks generative adversarial networks convolutional neural networks visual saliency video processing object detection unsupervised learning natural language processing neural networks video retrieval video generative models medical imaging retrieval multimedia audio multimodal deep learning attention models visual question answering artificial intelligence image classification self-supervised learning gan reinforcement learning imagenet machine learning semantic segmentation video summarization image processing neural machine translation perceptron instance segmentation instance retrieval visualization image segmentation object tracking speech recognition speech synthesis backpropagation adversarial training transfer learning lifelogging clustering eeg figure-ground segmentation sign language interpretability optimization speech dqn deep belief network architectures variational autoencoder affective computing object candidates upc video object segmentation image retrieval cross-modal learning vae policy image captioning lifelong learning vision object segmentation eye fixation wavenet nlp training restricted boltzmann machine face recognition domain adaptation tensorflow egocentric vision video analysis spatial transformer wearable cameras user interaction broadcasters archive search mediaeval saliency crowdsourcing human computing autoregressive models diffusion models deep neural networks gradient descent moderation memes hate speech ai hype annotations 3d reconstruction gnn graph neural networks visual scanpath pixelcnn loss function skip rnn multilayer perceptron incremental learning lstm teaching dataset optical flow nmt speaker identification inception resnet word embeddings autoencoder rbm barcelonatech person retrieval face detection hpc keras backward propagation activity locatization ranking 3d convolution cnn wearables diversity event recognition sentiment prediction search engine classification video indexing video annotation barcelona reranking google web toolkit indexing social event detection hierarchical partitions surf computer brain segmentation generative learning genai nist trecvid visual grounding minecraft xai explainable ai representation learning seq2seq q-learning panoptic segmentation autonomous driving moco perpcetron ai for social good mild cognitive impairment dementia visual dialog davis rvos mattnet referring expressions chain rule computational graph social networks resnext automl vgg neural architectures normalizing flows catastrophic forgetting incrmental learning policy gradients value function value markov decision proccess cloud computing sgd adam ealy stopping mini-batch batch normalization cross-entropy mlp point clouds 3d analysis set learning biometrics video segmentation visual localization geometry local iclr2018 dynamic computation self-learning lipreading captioning motion estimation action detection action classification action recognition rework adaptive computation time pixelrnn dbn methodology error function supercomputing gru lip reading softmax logistic regression linear regression active learning interestingness higher education rgbd multiview 3d images depth joint embeddings software t-sne epoch batch visual reasoning astronomy space ethics language cbir deep remote sensing activity recognition soundnet sonorization language model googlenet skip connections deep q-network network in network densenet nin skip thought word2vec data partition vanishing gradient relu catalunya catalonia cgan colorization location retrieval coclustering search engine optimization data augmentation computing theano software development caption natural language memorability eye tracker attributes college outreach aprenentatge automàtic inteligència artificial robots convnet open source alzheimer narrative object endoscopy rapid serial visual presentation electroencephalography brain-computer interfaces ccma relevance feedback lifeblogging python social event photo clustering instance search visual descriptors pattern recognition algorithm email image nearest neighbor bundling interest points digital images images web coding 3d streaming media iphone http mysql linux ios crowdmm acmmm etsetb telecom mobilitat erasmus javascript web toolkit wt c++ html web service web interface semantic shots image edge detection image representation professional documentalists video signal processing semimanual solution automatic keyframe selection companies broadcasting single representative keyframe algorithm design and analysis multimedia communication mutual reinforcement algorithm mediavela workshop pixable time series columbia regions phd thesis hyperlinking bag of features signal bci interface twitter television interactive microblogging tv labeling game

Ver mais

Atividades
Sobre

Universitat Politècnica de Catalunya

Apresentações

Crowdsourced Object Segmentation with a Game

Cristina Ruiz Sancho, "Tweet@TV, la Televisió Social en 140 caràcters"

Eva Mohedano, "Investigating EEG for Saliency and Segmentation Applications in Image Processing"

UPC at MediaEval Hyperlinking 2013

Part-based Object Retrieval with Binary Partition Trees

UPC at MediaEval Social Event Detection 2013

Automatic Keyframe Selection based on Mutual Reinforcement Algorithm

Interfície web per l’annotació semi-automàtica de plans semàntics

Reordenació i agrupament d'imatges d'una cerca de vídeo

Interactive Image Processing Demos for the Web

Co-advised Thesis in ETSETB mobility program

Servei de vídeos a la carta per a l'iPhone.

Photo Clustering of Social Events by Extending PhotoTOC to a Rich Context

Rich Internet Application for Semi-Automatic Annotation of Semantic Shots on Keyframes

Extensió d'una interfície de cerca d'imatges a les consultes amb regions

Búsqueda Visual con Retroacción de Relevancia basada en Actualización de Pesos

Interfaz gráfica de usuario para la búsqueda de imágenes basada en imágenes

Content based video summarization into object maps

Bundling interest points for object classification

Low computational cost algorithms for photo clustering and mail signature detection in the cloud

Contextless Object Recognition with Shape-enriched SIFT and Bags of Features

Visual instance mining of news videos using a graph-based approach

Exploiting User Interaction and Object Candidates for Instance Retrieval and Object Segmentation

Object segmentation in images using EEG signals

UPC at MediaEval 2014 Social Event Detection Task

Pyxel, una llibreria per a l’anotació automàtica de fotografies

Co-filtering human interaction and object segmentation

Visual Summary of Egocentric Photostreams by Representative Keyframes (BSc Ricard Mestre)

Relevance feedback for image retrieval with EEG signals

Aplicació rica d'internet per a la consulta amb text i imatge a la Corporació Catalana de Mitjans Audiovisuals.

Gostaram

The Impact of Segmentation on the Accuracy and Sensitivity of a Melanoma Classifier Based on Skin Lesion Images

Visual Information Retrieval: Advances, Challenges and Opportunities