Csaba Szepesvári

Ontology type: schema:Person     

Person Info





Publications in SciGraph latest 50 shown

  • 2014 On Learning the Optimal Waiting Time in ALGORITHMIC LEARNING THEORY
  • 2013-06 Alignment based kernel learning with a continuous set of base kernels in MACHINE LEARNING
  • 2012 Partial Monitoring with Side Information in ALGORITHMIC LEARNING THEORY
  • 2012 Invited Talk: Towards Robust Reinforcement Learning Algorithms in RECENT ADVANCES IN REINFORCEMENT LEARNING
  • 2011-12 Model selection in reinforcement learning in MACHINE LEARNING
  • 2011 Editors’ Introduction in ALGORITHMIC LEARNING THEORY
  • 2010 Toward a Classification of Finite Partial-Monitoring Games in ALGORITHMIC LEARNING THEORY
  • 2009-12 Training parsers by inverse reinforcement learning in MACHINE LEARNING
  • 2008-04 Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path in MACHINE LEARNING
  • 2008 Regularized Fitted Q-Iteration: Application to Planning in RECENT ADVANCES IN REINFORCEMENT LEARNING
  • 2008 Active Learning in Multi-armed Bandits in ALGORITHMIC LEARNING THEORY
  • 2008 Active Learning of Group-Structured Environments in ALGORITHMIC LEARNING THEORY
  • 2007 Tuning Bandit Algorithms in Stochastic Environments in ALGORITHMIC LEARNING THEORY
  • 2007 Improved Rates for the Stochastic Continuum-Armed Bandit Problem in LEARNING THEORY
  • 2006-06 Universal parameter optimisation in games based on SPSA in MACHINE LEARNING
  • 2006 RSPSA: Enhanced Parameter Optimization in Games in ADVANCES IN COMPUTER GAMES
  • 2006 Bandit Based Monte-Carlo Planning in MACHINE LEARNING: ECML 2006
  • 2006 Learning Near-Optimal Policies with Bellman-Residual Minimization Based Fitted Policy Iteration and a Single Sample Path in LEARNING THEORY
  • 2004 Margin Maximizing Discriminant Analysis in MACHINE LEARNING: ECML 2004
  • 2004 Enhancing Particle Filters Using Local Likelihood Sampling in COMPUTER VISION - ECCV 2004
  • 2000-06-09 Modular Reinforcement Learning: An Application to a Real Robot Task in LEARNING ROBOTS
  • 2000-03 Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms in MACHINE LEARNING
  • 2000 FlexVoice: A Parametric Approach to High-Quality Speech Synthesis in TEXT, SPEECH AND DIALOGUE
  • 1998-07 Module-Based Reinforcement Learning: Experiments with a Real Robot in AUTONOMOUS ROBOTS
  • 1998-04 Module-Based Reinforcement Learning: Experiments with a Real Robot in MACHINE LEARNING
  • 1998 Automated Detection and Classification of Micro-Calcifications in Mammograms Using Artifical Neural Nets in DIGITAL MAMMOGRAPHY
  • 1998 Performance-Evaluation for Automated Detection of Microcalcifications in Mammograms Using Three Different Film-Digitizers in DIGITAL MAMMOGRAPHY
  • 1997 Learning and exploitation do not conflict under minimax optimality in MACHINE LEARNING: ECML-97
  • 1996 Inverse dynamics controllers for robust control: Consequences for neurocontrollers in ARTIFICIAL NEURAL NETWORKS — ICANN 96
  • 1994 Self-Organized Learning of 3 Dimensions in ICANN ’94
  • 1993 Topology Learning Solved by Extended Objects: A Neural Network Model in ICANN ’93
  • 1993 Integration of Artificial Neural Networks and Dynamic Concepts to an Adaptive and Self-Organizing Agent in ICANN ’93
  • JSON-LD is the canonical representation for SciGraph data.

    TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

        "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
        "affiliation": [
            "affiliation": {
              "id": "https://www.grid.ac/institutes/grid.17089.37", 
              "type": "Organization"
            "isCurrent": true, 
            "type": "OrganizationRole"
            "id": "https://www.grid.ac/institutes/grid.9008.1", 
            "type": "Organization"
            "id": "https://www.grid.ac/institutes/grid.425196.d", 
            "type": "Organization"
            "id": "https://www.grid.ac/institutes/grid.5018.c", 
            "type": "Organization"
        "familyName": "Szepesv\u00e1ri", 
        "givenName": "Csaba", 
        "id": "sg:person.016202177221.23", 
        "sameAs": [
        "sdDataset": "persons", 
        "sdDatePublished": "2019-03-07T14:02", 
        "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
        "sdPublisher": {
          "name": "Springer Nature - SN SciGraph project", 
          "type": "Organization"
        "sdSource": "s3://com-uberresearch-data-dimensions-researchers-20181010/20181011/dim_researchers/base/researchers_1806.json", 
        "type": "Person"

    Download the RDF metadata as:  json-ld nt turtle xml License info


    JSON-LD is a popular format for linked data which is fully compatible with JSON.

    curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/person.016202177221.23'

    N-Triples is a line-based linked data format ideal for batch operations.

    curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/person.016202177221.23'

    Turtle is a human-readable linked data format.

    curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/person.016202177221.23'

    RDF/XML is a standard XML format for linked data.

    curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/person.016202177221.23'


    Preview window. Press ESC to close (or click here)